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1. Introduction. This paper has been written to meet a need which I believe is real. 
The theory of electrical networks involves questions of topology, and electrical engineers 
cannot be expected to be expert topologists. They need only a little, but it is difficult to 
get to know anything at all about topology without prolonged concentrated study, for it 
is a closely-knit subject. Hence the necessity to pull out of the body of topology and 
exhibit in what the mathematician might consider a clumsy popular form, those theorems 
which are of fundamental importance in network theory. 

Of such theorems there appears to be one outstanding, and to it this paper is devoted. 
Without an understanding of this theorem, every electrician who mixes mesh methods 
with branch methods must feel insecure; with an understanding of it, he should be able 
to see the essential simplicity of much that is otherwise obscure and difficult. 

The thought that the present attempt must be made, even by one who is neither 
topologist nor electrical engineer, came to me when reading for review P. Le Corbeiller’s 
book, Matrix Analysis of Electrical Networks (Harvard University Press, 1950). There 
the author avoids the essential topological issues, but it seemed to me that if only those 
issues could be discussed and understood, the whole significance of Kron’s method as 
expounded in the book would stand out more clearly. 

The matter here presented is based on the work of W. H. Ingram and C. M. Cramlet’, 
expanded in some respects to make the argument easier to follow but with omission of all 
that does not seem to bear directly on the fundamental question. 

Before proceeding to the technical arguments, I have inserted in the next section some 
philosophical ideas which may be obvious, but which I think need to be stated with a 
view to better understanding between mathematicians on the one hand and physicists and 
engineers on the other. 

2. The meaning of ‘‘proof”. The modern meaning of the words “mathematical proof” 
is well known: they imply a faultless logical chain which starts from undefined elements 
and axioms, no loop-holes or exceptions permitted. Those subjects which, like topology, 
have been developed in that spirit of rigor, present an almost impenetrable front to 
applied mathematicians or engineers. The extraction of one needed result, and an under- 
standing of what it means, demand a long and careful study in an uncongenial atmosphere, 
unrelieved by interpretations in terms of natural phenomena. 

It is obvious that this state of affairs cannot persist. Each mathematical subject must 
be treated on several levels, varying from that of extreme rigor down to simple intuitive 
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descriptions with no proof at all. We get this variety of treatment in older branches of 
mathematics, partly because the creators had not got the modern standards of rigor, and 
partly because these matters have been looked at so long by so many people and from so 
many different angles. 

Much of the argument that passes for proof in physics and engineering is not proof in 
the mathematical sense, and it is unlikely that the practical subjects will ever submit to 
the strict mathematical discipline. There seems to be an incompatibility between those 
minds which excel in logic and those which are capable of dealing successfully with prob- 
lems suggested by nature. 

The word “proof’’ has such a general usage that it is inconceivable that it should be 
employed only in its strictest mathematical meaning. Physicists and engineers will con- 
tinue to use it in a looser sense, and will regard as proved any proposition with regard to 
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Fic. 1. Network with 3 nodes and 3 branches (N = 3, B = 8). 





Fic. 2. Network with 4 nodes and 6 branches (N = 4, B = 6). 


which they can assemble sufficient evidence to convince them of its truth. As Descartes 
pointed out long ago, to “prove” something by a series of logical steps, and to ‘‘see”’ or 
‘“understand”’ it are not the same thing; and what the physicist or engineer needs is the 
“seeing” and the “understanding’’. 

The type of proof used in the present paper may appear clumsy, longwinded, and 
inaccurate to the pure mathematician. But that does not matter provided the proofs 
fulfil their purpose, which is to carry conviction to those who are willing to accept argu- 
ments with an element of intuition in them, when backed by appeal to a variety of simple 
and complicated special cases. All the proofs of elementary geometry carry conviction in 
this way; every property of the triangle is proved for a specific triangle drawn on a sheet 
of paper or imagined, and there is no guarantee (short of a plunge into the rigor of Hilbert 
or Veblen and Young) that with a different diagram the theorem may not prove false. 

3. The definitions of network theory. The professional electrician is interested in 
physical networks, consisting of wires, generators, and so on—pieces of apparatus that 
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actually work. The topologist is interested in undefined elements and axioms concerning 
them. They both agree that they may play in common a game with pencil marks on a 
sheet of paper, these marks being to the electrician a representation of his apparatus, and 
to the topologist a representation of his undefined elements. 

In speaking of the marks made on the paper, we may use the language of geometry 
(points, curves, etc.) or we may (more suitably for present purposes) use the language of 
the electrician. However, if we take the latter course, we must exercise great care not to 
read into each physical term its full physical meaning. Thus, to understand the line of 
thought at a certain point, we must be prepared to use the word “current”’ without ac- 
cepting as obvious that the ‘currents’ necessarily obey Kirchhoff’s law because all 
physical currents do. It is to avoid confusions of this sort that we have to be somewhat 
careful in the matter of definitions. 

Let us mark some points on our paper with heavy dots; for reference we may letter 
them A, B, C, --- in any order. These are nodes (or terminals or junction-points or ver- 
tices), and in general we shall denote their number by N. Figs. 1-4 show cases where 
N = 3, 4, 13 and 34. 





Fic. 3. Network with 13 nodes and 25 branches (NV = 13, B = 25). 


Next we join the nodes by lines or curves. Every such line or curve has a node at each 
end. On the lines or curves we put arrows indicating sense, one on each of them, the direc- 
tions of these arrows being distributed without any plan. The line or curve, with its 
arrow, is called a branch (or directed branch if we want to emphasize that it has a sense). 
For reference we may attach letters a, b, c, «++ to the branches, again not according to 
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any plan, or we may number them in any order; we shall in general denote the number of 
branches by B. Figs. 1-4 show cases where B = 3, 6, 25 and 53. 

It is understood that, though two branches may intersect in the diagram, it is for- 
bidden to pass from one branch to another by means of such an intersection. Such a pass- 
age can be made only through a node, and we must be careful to distinguish nodes from 




















Fic. 4. Network with 34 nodes and 53 branches (NV = 34, B = 53). 


intersections by marking the former with heavy dots. This is a trivial nuisance due to the 
use of a representation on a sheet of paper; it would not arise if we used, as we might, a 
network of strings in space with knots at the nodes. 

Any collection of nodes and branches is a network. 

The nodes are the ends of branches. In Figs. 1-4 no case is shown where a node is the 
end of just one branch, although this is allowed by the definition. Such a case appears of 
little interest from an electrical standpoint, for no current can flow along such a branch. 
But the idea of a node at the end of a single branch will become important when we discuss 
trees in the next section. 

There is another way in which we have exceeded our instructions in making the net- 
works shown in Figs. 1-4. They are connected in the sense that one can travel along 
branches from any one node to any other. From a topological standpoint, any network 
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consisting of two (or more) disconnected parts is to be regarded as two (or more) net- 
works. There may be electromagnetic connection between otherwise disconnected parts, 
but that will not concern us until Section 11 where the argument ceases to be purely 
topological. For the present we are to think only of connected networks. 

We now define a mesh (or circuit) by the following prescription. Starting from a node, 
traverse branches continuously, observing the following rules: 

(i) having started to traverse a branch at one end, continue to the other end; 
(ii) when you arrive at a node, leave it by a branch other than that by which you 
arrived, if there is another branch; 

(iii) if you arrive at a node which terminates only one branch (that by which you 

arrived), stop the operation and start all over again. 
If in the course of such an operation, you meet the same node (say A) twice, the set of 
branches you have traversed from A to A form a mesh. 

To each mesh we assign a sense (one of two possible senses), without regard to the 
senses already assigned to the branches which compose the mesh. In Fig. 2 ebda and bead 
are the same mesh, taken in opposite senses. 

4. Trees. A tree is a network of a particular type, namely a connected network con- 
taining no meshes. It is therefore a set of nodes and connecting branches, as shown in 


— 


Figs. 5, 6, 7. 


Fig. 5. Tree with one branch (VN = 2, T = 1;N = T +1). 


Fic. 6. Tree with 3 branches (VN = 4, T = 3; N = T + 1). 


We need certain facts about trees, and these we shall set down as theorems: 
THEOREM I: Every tree has at least one node which ts an end of only one branch of the tree. 


This is certainly true of the trees shown, and is easy to prove in general. Simply put 
your pencil anywhere on a branch of the tree and start moving it along the branches ob- 
serving the traffic rules laid down in connection with the definition of a mesh. Since there 
are no meshes in a tree, you cannot get back to the point from which you started. The 
number of nodes is finite, and so your journey must end. It can end only at a node which 
is the end of a single branch—the branch by which you have approached it. This proves 
the theorem. 

Since, from any given point on a branch, you have a choice of two directions in which 
to set out, a tree must contain at least two nodes each of which is the end of a single 
branch. But the theorem as stated is all we need for our purposes. 
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THEOREM II: Jn a given connected network it is possible to construct a tree containing all the 
nodes of the network. 

Figs. 8-11 show trees constructed in the networks of Figs. 1-4, the branches of the 
trees being shown by heavy wiggly lines in Figs. 8-10 and in a different way in Fig. 11. 


Fic. 7. Tree with 20 branches (N = 21, 7’ = 20; N = T + 1). 


Fic. 8. Tree (heavy wiggly lines) in network of Fig. 1. 
2 branches-in-tree; 1 branch-out-of-tree; 1 basic mesh 


(N=3,B=3,7T=2,M=B—-T=#1;M+WN =B +1). 


In manuscript a red pencil is best. Further experimentation with networks of his own 
contriving should convince the reader of the truth of the theorem, but here is a proof. 

Start with any branch of the network. It is either a branch of some mesh or it is not. 
(In Figs. 1-4, it must be a branch of a mesh.) If it is a branch of a mesh, remove the 
branch; its removal cannot make the network disconnected. If it is not a branch of a mesh, 
leave it. Testing each branch in this way, and removing it if it belongs to a mesh, we are 
left at each stage with a connected network. Finally all the meshes will be destroyed, and 
we shall have a connected network with no meshes, that is, a tree. Since we removed no 
nodes, the theorem is proved. 

This proof makes the construction of a tree seem rather more complicated than it 
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really is. Set a red pencil on a node, and draw it along a branch to another node. Continue 
step by step, rejecting beforehand the reddening of any branch which would complete a 


mesh. A tree will result. 





Fic. 9. Tree (heavy wiggly lines) in network of Fig. 2. 
branches-in-tree; 3 branches-out-of-tree; 3 basic meshes 


1B=67=3,M=B-T=3;M+N=B+1). 


> 


3 
(N = 








Fic. 10. Tree (heavy wiggly lines) in network of Fig. 3. 
12 branches-in-tree; 13 branches-out-of-tree; 13 basic meshes 
=12,M=B-—-T=13;M+N =B +1). 


(VN = 13,8 = 25,T = 
If T is the number of branches of a tree and N the number of nodes, then 


N = T + 1. 


[his is easy to see. For we can build up any tree by starting with a branch and the two 


(4.1) 
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nodes at its ends, and then adding one branch and one node (at the far end of the added 
-=2,T =1,N = T +1, and since in 


branch) at each step. When we start we have A 
each step we add unity to 7 and unity to N, the relation (4.1) holds at any stage, and so 


for the completed tree. 
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Fic. 11. Tree (tree in full lines, rest of network dotted) in network of Fig. 4 
33 branches-in-tree; 20 branches-out-of-tree; 20 basic meshes 
(N = 34,B = 53,T =338,M=B—-—-T=20;M+N =B +1). 


If we build up a general connected network like this, we might sometimes add branches 
(This does not occur for a tree, for such branches would create 


without adding nodes 
meshes, and these a tree, by definition, must not have.) Thus for any connected network 
(4.2) 


NSB+1. 


we have 
5. Branch currents. So far we have dealt with nodes and branches, and meshes 


formed of branches. 
With a branch we now associate a number 7, in general complex. (Note that 7 is a com- 
plex number like 3 + 47, where 7 = (—1)'”.) 

We recall that each branch has an arrow on it, pointing from one node (say A) to 
another node (say B). If 7 is the current in the branch, we say that a current 7 flows into 
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the node B, and a current —7 flows into the node A. (Equivalently we might say that a 
current —z flows out of B and a current 7 flows out of A.) 

So far the branch currents in a network are to be regarded as a set of B completely 
arbitrary complex numbers. The whole set may be represented by a matrix i with B rows 
and one column. 

6. Mesh currents. With a mesh of the network we associate a complex number 7’, 
which we call a mesh current. 

So far branch currents and mesh currents are separate things. We now set up a con- 
nection between them, so that we can say that a mesh current generates certain branch 
currents. 

Consider any mesh. It contains certain branches, some agreeing in sense with the mesh 
and some disagreeing. Let us put in a mesh current 7’. We say (as a matter of definition) 
that this mesh current generates in each branch of the mesh a branch current 7’ if the senses 
agree and a branch current —1' if the senses disagree; it generates no branch current in a 
branch not contained in the mesh. 

Thus in Fig. 2 a mesh current 2’ in the mesh adc generates branch currents 2’, 7’, —7’ in 
a, d, c respectively. 

Suppose now that we assign mesh currents in a set of meshes. We say that this set of 
mesh currents generates a set of branch currents obtained by adding together the branch 
currents generated by the several mesh currents in accordance with the rule given above. 

Thus if in Fig. 2 we put in mesh currents as follows: 


7; inebf, 
73 in ebe, 
73 in daeb, 


these generate the following branch currents: 


i_{=-—i, 
i=-t -§-«, 
i= —t, 
i= —-t, 


Latest, 

ty = —4 . 
Let us now, in any network, take R meshes and number them 1, 2, --- 2. We number 
the branches 1, 2, --- B. In the meshes we put mesh currents 7; , 23 , --- t% . Then it is 


clear, from the way in which we have defined the generation of branch currents from mesh 
currents, that the whole system of branch currents generated by the above mesh currents 


may be expressed by the formula 


ty = y Cte (p = 1, oe B) (6.1) 
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where the coefficients are numbers defined as follows: 


\ = 1 if branch p is contained in mesh gq and has the same sense, 
Co. ) = —1 if branch p is contained in mesh q and has the opposite sense, (6.2) 
= 0 if branch p is not contained in mesh q. 
We may write (6.1) in matrix notation: 
; = Cy’. (6.3) 


Note that the meshes employed here are completely arbitrary. 

7. Kirchhoff’s node law.* Kirchhoff’s node law states that the sum of all branch 
currents flowing into any node is zero. 

If we assign an arbitrary set of branch currents, they will not in general satisfy this 
law. Indeed, until we examine the question, we cannot be sure that, for a given network, 
we can find any set of branch currents to satisfy the law (except, of course, the trivial set 
of zero currents). 

Consider, however, the branch currents generated by a single mesh current. It is clear 
that Kirchhoff’s node law is satisfied at each node connecting branches of the mesh, and of 
course at all other nodes (0 = 0). The law remains satisfied if we superimpose any number 
of mesh currents, and so we have this result: 


THEOREM III: The branch currents generated by any set of mesh currents satisfy Kirchhoff’s 


node law at eve ry node. 


8. Basic meshes. In any given connected network, draw a tree containing all the 
nodes. All branches of the network then fall into two classes: 
(i) Branches-in-tree. 
(ii) Branches-out-of-tree (also called chords). 
The theorem which we wish to prove is this: 


THEOREM IV: Assigning arbitrary branch currents to the branches-out-of-tree, we can assign 
(and the assignmeni is unique) branch currenis to the branches-in-tree so that Kirchhoff’s node 


law is satisfied at every node. 


We saw in Theorem I that the tree has a node which is an end of only one branch-in- 
tree. Kirchhoff’s node law may be satisfied at that node by assigning a suitable branch 
current to the branch-in-tree, and of course this current is uniquely determined by the 
known currents in the branches-out-of-tree. 

Now regard the branch-in-tree with which we have been dealing as a branch-out-of- 
tree, with the branch current we have just found. This leaves us with a reduced tree, and 
all currents in branches-out-of-tree assigned. Treat this reduced tree in the same way. 
Step by step, we convert branches-in-tree into branches-out-of tree, each with an assigned 
current, and Kirchhoff’s law is satisfied at every node which we drop from the tree. Ulti- 
mately we reach a stage where the tree consists of a single branch. 

At each end of this single branch-in-tree, Kirchhoff’s law determines a current for it. 
For a moment it may appear that our method might break down here—the two ends of 

*As there is a possible confusion (Ingram and Cramlet, loc. cit.) as to which should be called Kirch- 
hoff’s “‘first’’ Jaw and which his “second” law, the unambiguous words “node”’ and “‘mesh’’ will be used 


here, 
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the branch might demand different currents. But that cannot occur. When we started, the 
total current flowing into the tree at all the nodes was zero, since each branch-out-of-tree 
made equal and opposite contributions at its two ends. This zero total flow is preserved 
each time we change a branch-in-tree into a branch-out-of-tree, and so when we have 
reduced the tree to a single branch, the flows into its two ends are equal and opposite in 
sign. Thus we can get rid of the last branch of the tree, turning it into a branch-out-of-tree 
with a uniquely assigned current. Then Kirchhoff’s node law is satisfied at every node and 
the theorem is proved. 

For any connected network we now define basic meshes, using a tree in the network, 
containing all the nodes. This is very simple. Each branch-out-of-tree has its ends con- 
nected by a unique set of branches-in-tree (necessarily unique, since otherwise the tree 
would contain a mesh), and together these form a mesh; to it we assign the sense of the 
branch-out-of-tree contained in it. Thus, if there are M branches-out-of-tree, there are M 
meshes defined in this way, each mesh containing only one branch-out-of-tree, and each 
branch-out-of-tree being contained in just one mesh. We call this a set of basic meshes of 
the given network. Once the tree has been chosen, the basic meshes are uniquely deter- 
mined; but since in general there will be several trees which may be used, so there are several 
sets of basic meshes. 

The total number of nodes in the network being N, the number of branches-in-tree is, 


by (4.1), 7 = N — 1. The total number of branches in the network being B, the number 
of branches-out-of-tree (the same as the number of basic meshes) is thus 
M=B-T=B-N+1, (8.1) 


or equivalently 


M+N=B+1. (8.2) 


This general topological relation for a connected network is a useful check in the case of 
complicated networks as in Figs. 10 and 11. 

Although topology (and, more generally, modern geometry) is a severe logical disci- 
pline, it seems right to use the word “‘topology”’ as we use the word “‘geometry”’ to describe 
properties of systems of dots and curves drawn on paper, or a net of knotted strings, or 
even an electrical network. In speaking of an electrical network, we try to clear our 
thoughts by separating its ‘“topological’”’ properties from its ‘‘electrical’’ properties. As 
long as the properties of which we speak are properties shared by systems of dots and 
curves and knotted strings, then we may say that we are in the topological domain. It is 
when we start connecting emfs and currents that we pass into electrical theory proper. 

In this sense, then, almost all that has been done up to this point, and indeed almost 
all until we reach Section 11, is topology. The mesh and the tree belong to topology, and 
so do the basic meshes, since they are derived directly from the tree. This is not a matter 
for controversy regarding rival spheres of influence. It is an attempt to assist understand- 
ing by analysing and classifying the elements which, taken all together and without 
analysis and classification, can be bewilderingly confused. 

9. Kron’s transformation matrix. Suppose we have a connected network before us. 
We pick out a tree, as we always can, and hence a set of basic meshes. Let there be M 
basic meshes. We number the branches 1, 2, --- B and the basic meshes 1, 2, --- 47, with 
no attention to any particular order. 

To the basic meshes we assign arbitrary mesh currents ¢] , 7, «++ t4,. Now (6.1) applies 
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for any meshes, and so we can express the branch currents generated by the currents in 
the basic meshes by the formula 


M 
ie ha 
be Cate ’ (p = 1, 2, meee B) (9.1) 
q=1 
where 
a So 1 if branch p is contained in mesh g and has the same sense, 
= —1 if branch p is contained in mesh gq and has the opposite sense, (9.2) 
eat 3 ) if branch p is not contained in mesh gq. 
In matrix form we have 
= Ci’; (9.3) 


here the B & M matrix C is Kron’s transformation matriz. 

We note that, by Theorem III, if the mesh currents are chosen arbitrarily, the branch 
currents given by (9.1) or (9.3) necessarily satisfy Kirchhoff’s node law at every node. 

But there is another important thing which we have to prove, namely, that by proper 
choice of the mesh currents i’ (9.1) or (9.3) will yield the most general set of branch 
currents i consistent with Kirchhoff’s node law. 

To prove this, let us assign any set of branch currents consistent with Kirchhoff’s 
node law. Then consider the tree in the network which corresponds to the basic meshes 
involved, and think of the branches-in-tree and the branches-out-of-tree. 

Each basic mesh contains just one branch-out-of-tree, and the branch current in it has 
been assigned. To the mesh containing that branch assign a mesh current equal to that 
branch current. Having done this for all the basic meshes, we have a set of mesh currents 
which generates a set of branch currents. We have deliberately made this set of branch 
currents coincide with the assigned branch currents in the branches-out-of-tree, and the 
coincidence for the branches-in-tree (and hence for all branches) follows from Theorem 
IV. Thus the assigned branch currents, arbitrary except for the satisfaction of Kirchhoff’s 
node law, can be generated by a suitably chosen set of mesh currents in the basic meshes. 

Let us summarise this result. It is the central point; what has gone before was prepara- 
tion for it, and what follows is easy deduction. It is the central theorem involved in Kron’s 
formula (9.3), or indeed in any mesh-branch transition in network theory. This is the 


theorem: 


THEOREM V: Given a network, there exists at least one set of basic meshes such that the following 
statements are true, the matrix C being defined as in (9.2): 
(i) If a set of mesh currents i’ is arbitrarily assigned, the branch currents given by i = Ci’ 
satisfied Kirchhoff’s node law at every node. 
(ii) If a set of branch currents i is assigned, arbitrarily except for the satisfaction of 
Kirchhoff’s node law at every node, there exists a set of mesh currents i’ in the basic 
meshes such that i = Ci’ 


10. Branch emfs and mesh emfs. For present purposes a branch emf e is simply a 
complex number associated with a branch of a network, and a mesh emf e’ is a complex 
number associated with a mesh. 

We bring these two concepts into relation with one another in a manner complemen- 
tary to that in which we brought together branch currents and mesh currents. We say 
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that if branch emfs are assigned in all the branches, they generate in any mesh a mesh emf 
equal to the sum of the branch emfs in the branches contained in the mesh, a + or — sign being 
prefixed to the branch emf before adding according as the senses of the branch and the mesh 
agree or disagree. 

If, as usual, we number the branches 1, 2, --- B and the meshes 1, 2, --- M (we take 
a basic set, although the immediate statement holds more generally), then the above 
verbal statement is obviously equivalent to the following formula conneeting mesh emfs 
e’ and branch emfs e: 


B 


ef ” ya Defy (q = 1, 2, ech M), (10.1) 


p=1 
where the coefficients are defined by 


{ = ] if branch p= is contained in mesh qg and has the same sense, 
D., { = —1 if branch p is contained in mesh g and has the opposite sense, (10.2) 
0 if branch p is not contained in mesh gq. 


When we compare this with (9.2), we see at once that D,, = C,, , or in matrix lan- 
guage D = C, where C, is the transpose of C. Thus (10.1) may be written 


lI 


B 
€ = } Cree» ’ (10.3) 
p=1 
or 
e’ =C,e- (10.4) 


To sum up: 


THEOREM VI: Given a network and a basic set of meshes in it, any assigned set of branch emfs 
e generate mesh emfs e’ according to the formula (10.4), where C, is the transpose of Kron’s 


transformation matriz. Hl 


We have then at this stage obtained, with it is hoped a proper understanding of their 
meanings, the two formulae 


i = C7’, e’ = C.e. (10.5) 


The currents and emfs have appeared so far only as numbers associated with branches and 
meshes, without electrical significance except in connection with Kirchhoff’s node law 
and the generation rules connecting quantities for branches and meshes. There has been 
no reference at all to an impedance matrix. 

We have been thinking of a single connected network. If we have a set of disconnected 
networks, the formulae (10.5) hold for each of them, and it is easy to see (such is the 
elasticity of matrix notation) that the whole set of such relations, a pair for each con- 
nected network, may be compressed into a pair of formulae of precisely the form (10.5), 
now covering the whole network. 

This ends what may be called the purely topological part of the paper. 

11. How to solve a network. Suppose we have a network, in general consisting of 
several parts, disconnected topologically but connected electromagnetically. Let the 
branches (B in number) contain generators of emf, d.c. or a.c. Asquare B X B impedance 
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matrix Z is given. The problem of solving the network consists in finding the branch 
currents 7, given the branch emfs e, only steady states being considered.* 

We cannot of course attempt to do this without some law connecting currents and 
emfs. To state this law, we define a set of B potential-differences W in the branches by the 
formula 

W =e — Zi. (11.1) 
We then accept Kirchhoff’s mesh law, which asserts that the swm of the potential differences 
for the branches of any mesh ts zero, each potential-difference being prefixed before adding with 
a + or — sign according as the senses of the branch and the mesh agree or disagree. 

This holds for any mesh, and we could continue our discussion for a while using any 
meshes at all. But to avoid unnecessary generality, let us apply Kirchhoff’s mesh law to a 
set of basic meshes. It is clear that the verbal statement made above is precisely equiva- 
lent to 


B 
> DW, = 0, (11.2) 


p=1 
where D.,,, is defined by (10.2). Thus Kirchhoff’s mesh law gives 


C.W = 0, 11.3) 


since, as we saw, D is the transpose of C. 
We now do some formal work with matrices. Multiplication of (11.1) on the left by C, 


gives 
C,e — C,Zi = 0, (11.4) 
and, by (10.5), this may be written 
e’ = Zi’, 11.5) 
where Z’ is defined to be 
Zz = CZC. (11.6) 


We have not achieved our objective; we have not found the branch currents i in terms 
of the branch emfs e. We have got no further than (11.5) which expresses mesh emfs in 
terms of mesh currents. And we can go no further without an additional assumption which 
will assure us that the matrix Z’ is not singular. 

12. The final step. The final assumption which we might make is that the network is 
dissipative, i.e. it contains resistances. We know then that if all emfs are put equal to zero, 
there cannot exist steady d.c. ora.c. currents, except of course zero currents. In the case of 
a.c. we can be more general, and say that the network may be resistanceless, but that the 
frequencies which we consider do not include resonant frequencies of the network; then 
in the absence of all emfs there can exist no periodic currents in the network, except of 
course the trivial zero currents. 

The essential assumption is that the network (and perhaps the frequencies considered) 
should be such that e = 0 implies i = 0. This is the assumption we shall make. 


* In the case of a. c.,i and e are complex constants from which the physical quantities are obtained 


by multiplying by exp (jpt) and then taking the real part. 
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Consider now the equation (11.5). The matrix Z’, as given by (11.6), is in part topo- 
logical (through C) and in part electromagnetic (through Z); but it is defined by the 
structure of the network, and is independent of the values of the branch emfs. 

Let us then put e = 0. This implies, under our assumption, i = 0. Going back to the 
tree from which the basic meshes were constructed in Section 8, we see that each mesh 
current is actually a branch current, namely, in a branch-out-of-tree. Hence i = 0 implies 
i’ = 0. Now e = 0 certainly implies e’ = 0 by (10.5). We conclude from (11.5) that the 


equations 
Zi’ = 0 (12.1) 
are satisfied only by i’ = Q. 
This means that the determinant of the matrix Z’ is not zero; the matrix is therefore 


non-singular, and Z’~ exists. 
The rest is simple. Restoring general values to e, we multiply (11.5) on the left by 


Z’ and get 


i’ = Z’~"e’. (12.2) 
Then we multiply on the left by C and use both the equations (10.5): this gives 
i = CZ’"'C,e. (12.3) 


or, inserting the value of Z’ from (11.6), 
i = C(C,ZC)'C,e, (12.4) 


which is Kron’s fundamental equation for the solution of networks. 
13. Conclusion. The essential steps leading ultimately to (12.4) are as follows: 
(a) Deduction of i = Ci’; this is topology, oriented by Kirchhoff’s node law. 
(b) Deduction of e’ = C,e; very simple when we have done (a). 
c) Deduction of e’ = Z’i’ from Kirchhoff’s mesh law. 
d) An assumption assuring the non-singular character of Z’, so that we can solve for i. 
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STUDIES ON TWO-DIMENSIONAL TRANSONIC FLOWS OF 
COMPRESSIBLE FLUID.—PART III* 


BY 
S. TOMOTIKA ano K. TAMADA 
University of Kyoto, Japan 


13. The introduction of a second new hypothetical gas. In the foregoing sections, we 
have investigated some transonic fields of flow by employing a hypothetical gas which 
closely approximates the real gas during an isentropic flow in the transonic range. 
Although such a hypothetical gas has a merit that the fundamental equation governing 
its flow assumes a rather simple form and can be solved exactly in several cases, it has 
the drawback that it can approximate the real gas obeying the isentropic law only for 
a limited (transonic) range of velocities. Thus, the method of analysis as developed in 
the foregoing sections has a rather narrow application; it can be applied only to nearly 
uniform transonic flows. 

In Part III, an attempt is made to develop a theory which is applicable even to a 
transonic flow containing limited supersonic regions as well as stagnation points. For 
this purpose, we have introduced a second hypothetical gas which is capable of repre- 
senting the real gas subject to the isentropic law with a better degree of approximation 
than that used in Parts I and II. 

For the sake of convenience, we start from the linearized equations of motion in the 
hodograph plane which are valid for any compressible perfect fluid. They can be written 


in the following forms: 


Yo = —XW, 
(13.1) 


Yo = Vw ’ 


where, as before, the coefficient X and independent variable w are respectively given by 
. riiy 7/1 1 
re -£(L) (4-3) 
p \pq p\q ¢ 


“p 
w= [ = dq. 
q°! 


“1 


(13.2) 


Eliminating ¢g from Eqs. (13.1), we obtain the fundamental equation for determining 


y in the form: 
Vow + X Woo = 0. (13.3) 


The coefficient X in the second term can evidently be expressed as a function of w alone 
by the use of the known equation of state of any gas together with Bernoulli’s theorem. 
The dotted-line curve in Fig. 20 shows the behaviour of X(w) for the case of the real 
gas obeying the isentropic law, the value of y having been taken equal to 1.4 for air. 

It is in general difficult to obtain exact solutions of the fundamental equation (13.3) 





*Received March 14, 1949. Parts I and II of this paper appeared in this Quarterly January (1950) 
and July (1950), respectively. 
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with the coefficient X(w) for the case of the real gas. It is therefore suggested that the 
function X(w) for the case of the real gas may be replaced by a simple approximate 
function in order to reduce Eq. (13.3) to a tractable form. The most simple replacement 
of X(w) by a suitable constant (as done by Chaplygin, Karman and Tsien), however, 
is not appropriate for the investigation of transonic flow. In Parts I and II, we have 
replaced the function X(w) for the case of the real gas by the tangent, as shown by the 
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chain-line curve in Fig. 20, at the point w = 0 (i.e. at the critical state g = c = 1) for 
the purpose of dealing with nearly uniform transonic flow. In order to treat transonic fields 
of flow containing stagnation points, however, it is necessary to make use of another 
approximate function which can approximate more precisely the function X(w) for the 
real gas. 

In the following lines, we shall use a function of the form: 


X(w) = a(1 — be**”), (13.4) 


where a, b and «x are constants to be determined adequately. As will be seen presently, 
the use of this function makes our fundamental equation (13.3) tractable. 

We have conveniently determined the values of the constants a, b and « in (13.4) 
in such a way that the curve of X(w) as given by (13.4) coincides with the corresponding 
curve for the real gas to the order of their tangents at w = — © (which corresponds to 
the stagnation point) as well as at w = 0 (which corresponds to the critical state). 
The expression for X(w) thus determined is given by 


X(w) = a(1 — e**”), 


( y) ie aoe 
OW +i Aas UE 


The full-line curve in Fig. 20 shows the curve of X(w) given by (13.5), by taking, as 
before, y = 1.4 for air. It will readily be observed that this curve can satisfactorily ap- 


with (13.5) 
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proximate the curve of X(w) for the real gas which is shown by the dotted curve in the 
figure. 

Now, as mentioned already, the form of the function X(q) defined by (13.2) depends 
upon the equation of state p(q) of the gas concerned. Conversely, any given expression 
of this function X(qg) determines the equation of state of a corresponding gas. Thus, 
by introducing, in the following analysis, a second new hypothetical gas, we take the 
function defined by (13.5) as an exact relation valid for such a hypothetical gas, instead 
of considering it as an approximation to the corresponding function for the real gas 
obeying the isentropic law, and we shall deal, in an exact manner, with the field of flow 
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of such a hypothetical gas, as done with the previous hypothetical gas in Parts I and II. 
If we insert the expression for X(w) given by (13.5) into Eq. (13.3), we obtain the 
fundamental equation for determining the flow of our new hypothetical gas in the form. 


You + al — er") Woe = 0, 


2 2/(y-1) y 4. 1 (7¥+1)/(7-1) 
“A ta 
Before proceeding further, we shall now discuss briefly the properties of our hypo- 
thetical gas. If we combine (13.2) with (13.5), we obtain the differential equation for 
determining the equation of state p(q) of our new hypothetical gas. We thus have 


meer (E. an 


(13.6) 


with 
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Solving this equation, by the method of numerical integration or otherwise, under the 
conditions that p = 1 and p’ = —1 at g = 1, we have obtained the curve of p(q) for 
our hypothetical gas as shown by a full-line curve in Fig. 21. It will be seen that this 
curve of p(g) coincides with the corresponding curve, shown by a dotted-line curve, for 
the real gas obeying the isentropic law, to the order of tangent at gq = 0 and to the order 
of curvature at g = 1. In this figure, the curves of r(q) for both our hypothetical gas 
and the real gas subject to the isentropic law are also given by a full-line curve and a 
dotted-line curve respectively, where 7 = e*” is a new variable to be introduced in the 
following section. 

14. A method of solving the fundamental equation (13.6). We now introduce new 
independent variables 7, 6 defined as: 


kw - p 
e = eX , “d ) 
€ € xp (. [ q C q ’ 


al | 


Il 


. 
(14.1) 

at K 
Va 


Then, the fundamental equation (13.6) for determining the flow of our hypothetical 





B 6. 


gas takes the form: 

Wer + tHe + (1 — 7°) vgs = 0. (14.2) 
It is evident that just as the fundamental equation for the isentropic flow of the real 
gas does, this equation (14.2) changes from the elliptic to the hyperbolic type according 
ast < 1 (i.e.qg < 1) or7 > 1 (i.e. g > 1), i.e. according to whether the flow is subsonic 


or supersonic. 
The characteristic curves of Eq. (14.2) are given by the equations: 


a oy ag 
+(8 — B®) = Wr —1-—cos'-, (14.3) 
T 
where {, is an arbitrary parameter. 
To solve Eq. (14.2), we first assume that 
y = T(ne*”, (14.4) 
with an arbitrary constant n. Then, we obtain an ordinary differential equation for 
determining the function 7'(7) in the form: 
aT dT 2 = “ 
Pogtr——n(1—7)T=0. (14.5) 
dr dr 
The general solution of this equation can be expressed in terms of Bessel functions. 
Thus, 
T(r) = AJ,(nt) + BY,(n7z), 
where A and B are arbitrary constants. Hence, the solution of (14.2) which is finite at 
7 = 0 can be expressed in the form: 


y= > A,J,(nze™, (14.6) 


n=0 


where 7 and A,’s are arbitrary constants. 
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If now we make use of the integral representation of J,(nr) of the Bessel type, 
namely’: 


J,(nr) = aif {¢t-' exp [r(t — 1/9/24)" 4 ; 
(0+) 


the expression for y becomes 
-¢ > A,{t"' exp [r(t — 1/0)/2 — ia)" 2 . 


~ Dei J (0+) n=0 


(14.7) 


Since, however, the series in the integrand: 


, A,{t™ exp [r(t — 1/t)/2 — 7g]}" 

n=0 
is evidently the expansion of a certain function in powers of t”' exp [r(t — 1/t)/2 — 76], 
the above solution can be generalized in the following form: 


_. 
ves] FOS 

with (14.8) 
Z = t"' exp [r(t — 1/t)/2 — *p], 
where F(Z) denotes an arbitrary function of the variable Z, and the path of integration 
C should be so chosen that this expression for y may really become the solution of 
Eq. (14.2). 

In fact, inserting the above expression (14.8) for y into the left-hand side of Eq. 
(14.2), which is conveniently denoted by D(W), we have 


Doo) = ae [ae LA5(¢+ 1) + ede] 


Thus, it will be seen that in order that the function y given by (14.8) becomes in effect 
the solution of Eq. (14.2), ie. Diy) = 0, the function: 


A= (E (1+ 2) + ih zo (Z = t' exp [r(t — 1/t)/2 — «p)) 


2 


must take the same value at the two end-points of the path of integration C and that 
in the case where the path C is a closed curve, it is sufficient that dF/dZ should be 
one-valued along C. 

15. A few fundamental solutions of equation (14.2). In this section, a few funda- 
mental solutions will be constructed by applying the method explained above. 

In the first place, we shall consider a solution which is obtained by taking the arbi- 
trary constant A, in the general solution (14.6) to be equal to 1/A", where A is a certain 
positive constant. By comparison with the results obtained in §§8, 9 and 10 of Part II, 
it is expected that such a solution will have a branch-point of order —1/2 at a certain 
point in the hodograph plane, the position of the singular point depending however 
upon the value of the parameter X. 


1G. N. Watson, A treatise on the theory of Bessel functions, Cambridge University Press, 1922, p. 20. 
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Since A, = 1/X’, the function F in (14.8) becomes 
: l 
| i aon 
[=< 
with (15.1) 
T l . 
Z = (1 _ i) — log t — i8 — log X. 
Thus, if we denote the solution under consideration by ¥;-1/2) , we have 
YF dt se 
V(-1/2) ved | eat ew (15.2) 
477 Jc t(1 — e) 


where C is a path of integration to be so chosen that this ¥-,,2) actually becomes the 
solution of the fundamental equation (14.2). 
Let ¢ = t, bea root of the equation z(t) = 0.* Then, in the neighbourhood of the point 
t = ¢, , the integrand in (15.2) can be expanded in the following form: 
] ] ] 
a Gene wees oh, (IY 
t(1 _— e*) t,(dz | t oe to ); 
and therefore it will be seen that the integrand has a pole of the first order at the point 
t = t) , provided that (dz/dt),.,, ¥ 0. 
It is readily found that if we take, as the path of integration C in (15.2), a small 
closed curve enclosing only the point ¢t = ¢ but not any other singular points, the func- 


tion ¥,_1/2) given by (15.2) actually becomes the solution of the fundamental equation 
(14.2). The value of the integral can then be obtained by evaluating the residue of 
the integrand at the point ¢ = ¢, and thus, taking the integral along the path of inte- 


gration in the positive sense, we have 
ee a Te 
ie to(dz/dt),-1, 1 —(r/2)(to + 1/to) 
If we here put ¢, = e'* for the sake of convenience, the expression for ¥,_1,2) becomes 
ultimately 
1 g of 
YVi-1/2) = ‘3 Sy (15.3") 
— T COSW 
On the other hand, since t = t) has been assumed to be a zero-point of the function 
z(t) as defined by (15.1), we have z(t,) = 0, and when use is made of the above sub- 
stitution tf, = e’*, this equation becomes 


B—ilog\ + 7rsinw —w = 0. (15.4) 
By eliminating w from the above two equations (15.3) and (15.4), we can express 


¥.-1/2) in terms of 7 and 8. 


*It may be remarked here that this equation has in general two different roots. 
It can be ascertained without difficulty that this y~1/2) really becomes a solution of the fundamental 


equation (14.2). 
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For a special set of values r.. , 8. of the independent variables 7, 8, the two equations: 
B —itlogvX\ + rsinw —w = 0, 
(15.5) 
1 — rcosw = 0 
are satisfied simultaneously,‘ and the function ¥;~1/2) as given by (15.3) becomes in- 
finite. Thus, it is found that as has been expected from the outset, the solution 4-1/2) 
has a branch-point of order —1/2 at the point (7. , 8.) in the 7, 6 plane. 
Elimination of w from the two equations in (15.5) gives the equation for determining 
7. and #.. in the form: 


Bo + {log d + V1 — 72 + cosh” = 


} = 0. (15.6) 


In case when 7. S 1, both 1 — r2 and cosh” (1/7..) are real and therefore, separat- 
, 


ing the real and imaginary parts, we have 


- 1 (15.7) 

—log Xk + V1 — 12 F cosh" ti 0. 
From these equations it will be seen that the singular point of the solution y¥,-;,2) in 
the 7, 8 plane is, in this case, situated at an isolated point r = 7. on the axis 8 = 0. 
The second equations in (15.7) give the relationship between the coordinate r.. and the 
parameter \. The curve of r.. plotted against is shown in Fig. 22, from which it is 


tT | 
ab SSRa 
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readily found that in order to make r.. to have any value in the range from 0 to 1, it is 
quite sufficient to take for \ only a value in the restricted range from 0 to 1. 
Further, for the hypothetical gas under consideration, we have, from (13.2), (13.5) 


and (14.1), 
*f1 1 
f(s —_ 1) _ a(r’ — 1). 


‘It is readily found that this corresponds to the case when (z);-+, = 0 and (dz/dt)..:, = 0, i.e. when 
the two roots of the equation z(t) = 0 coincide with each other. 
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Therefore, the Mach number M associated with the state at the singular point r = 
To , 8 = Ois calculated by the formula: 


M = “a = V1 + ap2(r2 — 1). (15.8) 
Next, we shall consider the case when r. = 1. In this case, both 7/1 — 72 and 


3 . . . . . . 
cosh” (1/7.) are purely imaginary and therefore, by separating the real and imaginary 
parts on the right-hand side of Eq. (15.6), we have the following two equations: 


log \ = 0, 
Soc ee 1 (15.9) 
Bo F Wrz — 1+ cos’ — = 0. 
To 


Hence, it will be seen that in the supersonic region where r = 1, the singularity occurs 
only at a particular value X,,,. = 1 of the parameter \, and this time the solution ¥,_,,2) 
becomes infinite, not at an isolated point but along two curves in the 7, 8 plane which 
satisfy the second equations in (15.9). 

Further, by comparing (15.9) with (14.3), it is found that these curves of singularity 
are nothing less than the two characteristic curves passing through the point + = 1, 
B = 0. 

Thus, summarizing the above results we see that when the parameter \ assumes a 
value in the range 0 < \ < 1, the singularity of the solution ¥,_,,2) in the hodograph 
plane (i.e., the 7, 8 plane) occurs at an isolated point on the axis 8 = 0, but in the limit 
A — 1, the singularity is prolonged along the two characteristics passing through the 
point 7 = 1, 8 = O. This remarkable characteristic change of the singularity of the 
solution occurring in the hodograph plane at the stage of transition from the subsonic 
to the supersonic region is quite similar to what has been already found in Part II in 
the case of the more simple fundamental equation (7.6) of the mixed type. 

vastly, we shall derive from the preceding solution ¥,_,/2) another solution which 
will have a branch-point of order 1/2 at a certain point in the hodograph plane. 

Now, in general, the form of our fundamental equation (14.2) suggests that any 
new solution can be obtained by differentiating or integrating one of the known solutions 
with respect to 6, and as inferred from what has been pointed out in Part II, the order 
of singularity of the solution thus derived would differ by unity from the order of the 
original solution. Hence, the required solution, denoted by ¥,1,2) , can be derived from 
the preceding solution ¥,_,,2) given by (15.3) by integrating it with respect to 8, and 


we have 
4 dB / 1 dw / 
/ = — —————————————— = eee — eee — 1. — dy 
Vary | l — 7 cOSsw 1 — +r cosw dw/dg 7 ’ 


where use has been made of the relation dw/d8 = (1 — 7 cos w)' which can easily be 
obtained from (15.4). It is thus found that the variable w itself is a solution having the 
singularity of order 1/2. 

On the other hand, the variable 8 itself also becomes evidently a solution of the 
fundamental equation (14.2). Therefore, it will be seen from equation (15.4) that the 
function 7 sin w becomes also a solution having the singularity of order 1/2. Thus, 
denoting this solution by ¥,1/2) as before, we have 
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Vu) = TSiNw. (15.10) 


It can easily be proved that the singularity of these solutions y,,,2, show also the char- 
acteristic change as mentioned above, just as in the case of the preceding solution 
¥(-1/2) - 
As will be shown in later lines, we can discuss a uniform flow past an obstacle by 
making use of an appropriate linear combination of the above solutions y,_,/2) and W.1,2) « 

16. An alternative method of solving the fundamental equation (14.2). In the 
analysis developed in §14, we have used the integral representation of Bessel’s type for 
the Bessel function /J,(m7) in (14.6). However, if, instead of employing the integral 
representation of Bessel’s type, we make use of the integral representation of Poisson’s 
type for J,(n7), we can develop an alternative analysis similar to the preceding one, 
which enables us to calculate conveniently the limiting case of the incompressible fluid 


flow. 
Now, according to Poisson, the Bessel function J,,(n7) can be expressed in the form:° 
(n7/2)” “i : coal 
Jar) = Sooo exp {in7 cos 6} sin” 6 dé. 
(n7) = Ta 1/2)rd/y J, &P! 


If we insert this expression in the right-hand side of (14.6), we have 


rT @ n 
T . . . 2 
y= } B( exp {—78} exp {i7 cos 6} sin 0) dé, 
“0 n=0 Mh 

7 . 6 

where B,’s and yu are arbitrary constants. 
Thus, summing up the integrand into the form of an arbitrary function G(e’), as 
done in the previous case, we get the general solution of our fundamental equation 

(14.2) in the form: 


[ Ge) ao, 


with (16.1) 
log = i8 + ir cos 6 + 2 logsin 6, 
Mu 


< 
| 


II 


£(8) 


where the path of integration from 6 = 0 to @ = zm should be taken suitably in con- 
formity with remarks which will be given presently in the next section. 
We now consider a limiting case in which 
7— 0, 
i ° 
~ — £(finite). (16.2) 
Me 
u— 0, 


Such a limiting case corresponds evidently to the case of the incompressible fluid flow, 
and therefore, if we denote the corresponding limiting value of y by y;,. , we have 


px/2 
Vin. = 2] GW sin’ 0) dé, 
“0 
with (16.3) 
W = te, 
5G. N. Watson, A treatise on the theory of Bessel functions, Cambridge University Press, 1922, p. 48. 
‘Like the previous constant A, this constant » is a parameter determining the Mach number of the 


flow. 
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We assume that the stream function y;,.. for a known incompressible fluid flow is 
given by 
Vio. = g(W). (16.4) 
Then, substituting this in the left-hand side of (16.3), we have an integral equation for 
determining the arbitrary function G in the form: 


at, 


g(W) =2] G(Wsin’ 6) ao. (16.5) 


If we put 
W sin’? 6 = x, 


this equation reduces to the form: 
Ww : 
= I] G 
Jo YWW-—x vx 
This is nothing but an integral equation of Abel’s type and the solution is obtained by 
making use of the well-known formula as: 


lx. (16.6) 


a eo ae ee 
0s) = 3 vad | fea 


z — g'(W) } 
= - 0 / —_——-. gw. 
~ \ 90) + vx | Tay 


Thus, if we put this expression for G into the integrand in the formula (16.1), we 
shall obtain the stream function for the corresponding flow of a compressible fluid. In 
the following lines, we shall give a few examples. 

(a) As a first example, we consider the case in which 

r 7\-1/2 
Vine. = g(W) = (1 ~ W) . 
Putting this in the above formula (16.7), we readily have 


- 
veined s ote 


(16.7) 





Thus, inserting this expression for G in (16.1), we have ultimately 


aoe 
es T I ee a 
with (16.8) 
¢(é) = log 7 — i8 + ir cos 6 + 2 logsin @. 


(b) As a second example, we next consider the case in which 
Vine. = g(W) _ log (1 = W). 


Substituting this expression for g(W) into (16.7), we get 


—1/2 —1/2 
Gx) = - 2(2 _ 1) tan™ (2 — 1) 


x 
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Therefore, by (16.1), we obtain a solution of the form: 


y= - 2 / (ef — 1)7* tan™' (€* — 17” do, 
with (16.9) 
(0) 


II 


log 7 — i8 + ir cos 6 + 2 logsin 0. 


17. Remarks on the path of integration for the integral (16.1). From the above 
examples it will be seen that when any solution for the incompressible fluid flow reveals 
singularity at the point W = 1, the integrand of the corresponding solution of the 
integral form (16.1) for the compressible fluid flow has in general a singular point at 
6 = 0, where 4 is a root of the equation: 


¢(6) = log 7 — i8 + ir cos 6 + 2 logsin 6 = 0, (17.1) 


and the position of such a singular point 6 = 6, in the 6-plane varies with the values 
of 7 and @. 

Now, it is well known that in case a singular point of the integrand of any function 
defined by a definite integral moves across the path of integration with the variation 
of a variable, the said function loses in general its analytic continuity with regard to 
the variable. Therefore, in order that the solution (16.1) be capable of maintaining its 
analytic continuity, it is necessary that the path of integration should be such a curve 
connecting the two points @ = 0 and @ = z which can be deformed, with the variation 
of 7 and 8, without being cut across by any singular point of the integrand (Fig. 23). 





4¥-Plane 





Fic. 23. 


18. The singular point of the solution (16.1). Next, we shall discuss the singular point 
of the solution ¥ given by (16.1), which corresponds to the singular point W = 1 of 
the stream function y;,.. for the incompressible fluid flow. 

Now, it is found that Eq. (17.1) has in general two roots, and the corresponding 
two singular points of the integrand of y are usually situated in such a way that the 
one, denoted by P, lies on one side of the path of integration, while the other, denoted 
by Q, on the opposite side (Fig. 24). However, for a particular set of values of r, 8, which 
will be denoted here by 72 , 8» a8 before, the confluence of these two singular points 
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may occur. In this particular case, then, the path of integration must pass through the 
confluent singular points, and the solution y has a singularity. 

Thus, it is found that the singular point (7. , 8.) of the solution (16.1) is determined 
by the two equations ¢ = 0 and df/dé@ = 0, namely: 


log 7 — i8+ircos6+2 log sin 6 = 0, 
‘ (18.1) 


— irsin 6+ 2 cot 6 = 0. 


Eliminating @ from these equations, we obtain the equation for determining 7. , 


B. in the form: 


ees 9 wee 
Bo ~i F Vr2 —1+4+ 7 log (2 Fivr.- D} = 0, (18.2) 


and this equation corresponds to Eq. (15.6) in the preceding analysis. 








0 re Tv 
\ 
~Q 
9-Plane 
Fia. 24. 


After some calculations it can be shown however that this equation becomes in 
accord with the preceding equation (15.6), if we take the value of the parameter yu equal 
to 2e-*\. Therefore, further development of the analysis can be made along the same 
lines as in the case of Eq. (15.6), and we thus arrive at the conclusion that the singu- 
larity of the present solution (16.1) has also the same characteristic features as those 
of the preceding solution ¥,_;/2) which have been described in detail in §15. 

19. Tke solution giving the uniform flow past an obstacle. In the following lines, we 
shall discuss the flow past an obstacle by making use of the two fundamental solutions 
¥¢-1/2) and Y1/2) obtained in §15, namely: 


1 
Yi-1/2) = a (19.1) 
Yas = TSIN ow, (19.2) 
with 
B-tlogA\ + rsnw —w = 0. (19.3) 
We put 


o=r+is. 
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Then, inserting this in the left-hand side of (19.3) and separating the real and imaginary 
parts, we obtain the relations between 7, 8 and r, s in the forms: 


$+ log r 


TT = ’ 


cos r sinh s (19.4) 


r — (s + log A) tanr coth s. 


B 
In the first place, we shall examine a solution as expressed by the imaginary part 
of ¥-1/2) given by (19.1), namely: 
— (s + log d) tanr 
= I{y¥,_,,)} = = ; is: 19.5 
v Ye-172 {1 — (s + log A) coth s}~ + (s + log A) tan’ r ( ) 





Putting the denominator equal to zero, we have the equations for determining the co- 
ordinates 7. , 8. of the singular point of y in the forms: 


1 — (s. + log A) coths. = 0, 


| 
S 


(s. + log A) tanr. = 


8) | 


tanh s.. = s. + log \X, 


i. = 0. 
Thus, taking (19.4) into account, we obtain the result that 
To = sech & , B. = 0, 


and therefore it is seen that the solution y under consideration has a singularity at the 
point 7 = 7. = sechs. , 8 = 8B. = 0. 

Also, it will be shown without difficulty that y is always equal to zero along the axis 
8 = Ofrom7 = Otor = 7.2. 

Along any streamline y = const., we can in general express r as a function of s, with 
the aid of (19.5). Therefore, if we substitute the function r(s) so obtained into Eqs. 
(19.4), we can obtain the relation between 7 and 8 with s as the parameter. In other words, 
we can thus obtain the flow pattern in the 7, 6 plane, which is shown in Fig. 25. 

In the next place, we shall consider a solution which is obtained by taking the real 
part of ¥.1/2) as given by (19.2), namely: 

¥ = R{ Way} = rsinr coshs. (19.6) 


It can easily be shown that this solution possesses a branch-point of order 1/2 at the 
same point (r = 7. = sechs. , 8 = 8. = 0) as the preceding solution (19.5), and that 
¥ becomes always equal to zero along the axis 6 = 0 from r = 0 tor = 7... In this case, 
the flow pattern in the 7, 6 plane becomes as shown in Fig. 26. 

From these figures it is naturally expected that if we superpose the above two solu- 
tions (19.5) and (19.6) appropriately, we can obtain a solution which would represent 
the flow pattern in the hodograph plane (i.e., the 7, 8 plane) as shown in Fig. 27(a), 
and such a solution would give a required field of flow past an obstacle in the physical 
plane as shown in Fig. 27(b), as would be expected from the physical meaning of the 


hodograph plane. 
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The superposed solution is expressed in the following form: 


y= Nik sin w + -—— 1 \ 
1 — 7 cosw 














with (19.7) 
B—tlogv\ + rsinw —w = 0, 
where K is a certain constant to be determined appropriately. 
a! — 
“wf 
Fig. 25. ¢-1/2) Fic. 26. 1/2) 
a 
Zs 
“Lp, 
Nv 
q<C q>c ee 








(Bb 
A B __A ~q----- rc 
~~-4-_-p-------- 
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\ r 








Fig. 27. (a) Hodograph plane. Fia. 27. (b) Physical plane. 


If we determine the value of K in such a way that the leading edge of the obstacle 


coincides with the point r = 0 in the hodograph plane, we have 
— 13 
2 (19.8) 


K=17° 





1951] TWO-DIMENSIONAL TRANSONIC FLOWS 143 


In this case, it is easily found that the leading edge of the body becomes a stagnation 
point and that the upper and lower surfaces of the body meet at a finite angle (2+/a/x)r 
there. 
If, in the fundamental equations (13.1), we change the independent variables from 
w, 6 to 7, B by the aid of (13.5) and (14.1), we have 
ge = Va(r — 1/z)¥p, 
- (19.9) 
= Va TY, « 
Thus, putting the expression (19.7) for y into the right-hand sides of these equations 
and carrying out simple integrations, we obtain the corresponding velocity potential ¢ 
in the form: 


g= Var ~iK(; + cos a) + atte} (19.10) 


1 — r cOSw 


We shall next consider the transformation equations from the 7, 6 plane to the 
physical x, y plane. In general, we have 


dx = x, de + 2x, dy, 


dy=y,de+ y, dy. 


Inserting in the right-hand sides of the well-known expressions for x, , y, , Xy , Yy as 


given in Part I, namely: 


1 - 
Ll, =— cos 8, ia = —ee 
¢ q v q ’ 
1 = sine y = oe 
Yo q. ’ v pq ’ 


and carrying out integrations, we obtain the coordinates x, y on any streamline y = 
const. = y, in the physical plane in the following forms: 


r@ v1 
t= | (2 cos 0) dy — [ (4 sin a) dy, 
d q yoy: Jo pq =o 


Vi 1 
dg + [ (+ cos 0) 
Jo pq 


a 
y= [ (! sin a) 
“eo \9 v= 


where the point ¢ = ¢ , ¥ = 0 has been adjusted so as to correspond to the origin of 
the x, y plane. 

In particular, the coordinates on a particular streamline y = 0, a part of which 
coincides with the surface of the body, are calculated by the following formulas: 


[ (2 cos a) dy, 
— [io (Esin a) de. 


Po 


(19.11) 
dy, 


. e-¢ 


x 
(19.12) 
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Since Eqs. (19.7) and (19.10) give the relations between y, y and 1, 8 (and conse- 
quently, between ¢, y and gq, 6), all the integrands in the formulas (19.11) and (19.12) 
can be expressed as functions of g, y. Therefore, carrying out integrations, we can obtain 
the coordinates (x, y) on each of various streamlines and the flow pattern in the physical 
plane can thus be found. 

20. Numerical computations. By assuming y = 1.4 for air, detailed numerical com- 
putations have been carried out for three cases in which the Mach number M of the 








undisturbed uniform flow is equal to 0.717, 0.745 and 0.752 respectively, paying special 
attention to the state of affairs on the surface of the body, the corresponding values of 
the parameter \ being 0.542, 0.600 and 0.616 respectively. Here, M = 0.717 is the so- 
called critical Mach number at which the maximum local Mach number in the field 
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Fig. 29. 


of flow becomes equal to unity. Fig. 28 shows the streamlines y = 0 in the 7, 6 plane 
for these three cases. 
The coordinates (zx, y) of the point on the surface of the obstacle in the physical plane 
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have been calculated by the formulas (19.12). Denoting the chord length of the body 
by J, the values of x/l and y/l are shown in Table II. The profiles of the obstacles for 
the two cases in which M = 0.717 and M = 0.752 respectively are shown in Fig. 29. 
The profile for the case 1 = 0.745 has been omitted, however. 

It will be seen that there is a very satisfactory coincidence, especially in the forward 
part as well as in the vicinity of the trailing edge, between the profiles for the three 
cases, and therefore it may be assumed that the shape of the obstacle is fixed in spite 
of the variation of the Mach number M of the undisturbed flow. 

Values of the fluid velocity g on the surface of the obstacle have been calculated 


TABLE II 























M = 0.717 | M = 0.745 M = 0.752 
(A = 0.542 | (A = 0.600) (A = 0.616) 
y/l q ] x/l y/l q x/l y/l q 
0 0 0 0 0 0 | 0 0 0 
0.0299 0.0104 0.686 || 0.0323 | 0.0106 | 0.706 1 0.0338 | 0.0111 | 0.709 
0.0874 0.0236 0.793 || 0.0943 0.0246 0.821 || 0.0669 0.0191 0.7 
0.1399 | 0.0330 | 0.854 || 0.1506] 0.0346 | 0.881 || 0.1274 | 0.0310 | 0.862 
0.1897 0.0402 0.898 || 0.2040] 0.0420] 0.930 | 0.1837 | 0.0396 | 0.917 
0.2376 0.0455 0.927 0.2553 0.0475 0.967 || 0.2375 0.0460 0.961 
0.2843 0.0493 0.951 0.3051 0.0514 0.997 || 0.2894 | 0.0506 0.994 
0.3299 0.0518 0.972 || 0.3535 | 0.0539 | 1.023 || 0.3397 | 0.0537 | 1.022 
0.3747 0.053 0.989 | 0.4008 | 0.0552 | 1.043 || 0.3888 0.0556 1.048 
0. 4000 0.0533 0.997 || 0.4264) 0.0553 | 1.053 || 0.4367 | 0.0561 | 1.069 
0.4190 0.0532 0.999 || 0.4472 | 0.0552 | 1.062 || 0.4753 | 0.0555 | 1.083 
0.4632 0.0520 | 0.988 || 0.4930 | 0.0541 | 1.077 0.5217 | 0.0538 | 1.097 
0.5086 0.0493 0.947 | 0.5385 | 0.0517 | 1.075 || 0.5681 0.0506 1.067 
0.5563 0.0450 0.896 || 0.5621 | 0.0496 | 0.986 || 0.5730 | 0.0501 1.000 
0.6065 0.0395 0.853 0.6142 0.0432 0.893 || 0.5940 0.0475 0.938 
0.6590 0.0333 0.817 || 0.6704 | 0.0357 | 0.841 || 0.6502 | 0.0397 | 0.864 
0.7136 0.0267 0.785 || 0.7298 0.0275 0.800 || 0.7103 | 0.0311 0.815 
0.7704 | 0.0199 0.757 || 0.7919 | 0.0192 | 0.765 || 0.7736 | 0.0223 | 0.777 
0.8293 0.0132 | 0.729 || 0.8569} 0.0113} 0.733 0.8401 0.0137 | 0.742 
0.8904 0.0070 | 0.704 || 0.9246 | 0.0044 | 0.703 0.9098 | 0.0060 | 0.709 
0.9536 0.0020 0.682 || 0.9954 | 0.0001 | 0.678 || 0.9827 | 0.0005 | 0.681 
0.9861 0.0004 0.672 || 1.0000 0 0.675 || 1.0000 0 0.676 
1.0000 0 0.667 | | 
|} 

















and they are given in Table II, and the velocity distributions on the surface of the body 
are shown in Fig. 30. 

It will be seen clearly from Fig. 28 that when M = 0.752, the streamline y = 0 
becomes in contact with one of the characteristic curves of the fundamental equation 
(14.2) at some point P in the 7, 8 plane. In other words, the singularity J = 
a(q, 0) = 0 makes its first appearance at this Mach number at some point in 


O(2, y) 
the field of flow in the hodograph plane, and the velocity gradient becomes infinite at 
the corresponding singular point P on the surface of the body as shown in Fig. 30. We 
shall denote by 17, the Mach number at which infinite velocity gradient occurs on the 
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surface of the body and a trace of the so-called shock line (i.e., the singular point J = 0) 
first appears, and call it provisionally “the shock Mach number”’ for the sake of con- 
venience. Thus, for our obstacle we have M, = 0.752. 

From Fig. 30 it will readily be found that the curve of velocity distribution does 
not reveal any peculiarity even at the critical Mach number M = 0.717. But, it becomes 
rapidly steeper in the vicinity of the point P as the Mach number increases until, at 
the above-mentioned shock Mach number, it has an infinite gradient at the point P. 


12 | ] 
q x Liepmann’s 


observation 


e M=0.795 
x M=0.843 
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For still higher Mach number, two curves of singularity J = 0 grow up from the surface 
of the body. Then, the theoretical field of flow becomes many-valued in the neighbour- 
hood of such curves. At this stage, shock waves must form in the actual field of flow so 
as to avoid the appearance of the many-valued region which is expected theoretically. 

21. A comparison with observation. Quite recently, Hans W. Liepmann’ has meas- 
ured the pressure distributions on the surface of a biconvex circular arc profile placed 
in a high-speed wind tunnel. The dotted-line curve in Fig. 29 shows the biconvex profile 
of thickness ratio of 0.12 used by Liepmann in his observations, by adjusting it, for 
the sake of comparison, so as to have the same position of the leading edge as well as 
the same maximum thickness with those of our profiles derived theoretically, which 
have been shown by full-line curves. 

From Liepmann’s observed pressure distributions, we have calculated, by the use 
of Bernoulli’s theorem together with the isentropic law, the velocity distributions on 
the surface of his obstacle in two cases in which M = 0.795 and M = 0.843 respectively. 
The observed values of the velocity g thus found are shown in Fig. 30 by small black 
circles and crosses respectively. It may be remarked here that at the former Mach 
number M = 0.795, the first appearance of shock waves was observed. Taking account 


THans W. Liepmann, The interaction between boundary layer and shock waves in transonic flow, J. Aero. 
Sci., 13, 623-637 (1946). 
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of the discrepancy between the calculated profile and the biconvex profile used in Liep- 
mann’s observations, it may be said that the agreement between the theory and ob- 
servation is satisfactory. 

It seems worth noticing here that (a) the observed position of the main shock wave 
as appeared at an earlier stage falls within the many-valued region derived theoretically 
in the above and that (b) the main shock wave inclined obliquely forwards as observed 
by Liepmann, which is enveloping Mach waves starting from the inside of the field of 
flow but not from the surface of the body, should be just compared with the curve of 
singularity J = 0 found theoretically, which is as well an envelope of one family of 
Mach waves starting from the inside of the field of flow. 

Now, as will be seen from Fig. 30, the theoretical field of flow for the case in which 
M = 0.745, for example, is evidently partially supersonic in limited regions, but is 
still capable of being continuous and irrotational throughout the whole field of flow. 
Hence, we arrive at the affirmative positive answer to the so-called Taylor’s problem 
enquiring about whether there is any theoretical possibility of the existence of a con- 
tinuous irrotational flow of a compressible fluid past an obstacle such that it flows 
uniformly at a great distance from the body and at the same time contains limited 
supersonic regions in the neighbourhood of the obstacle; namely, the theoretical results 
of our analysis show that when a body is placed in a uniform stream of a compressible 
fluid moving at speeds less than that of sound, the local speed of flow can exceed that 
of sound in some limited regions in the vicinity of the body without violating the irrota- 
tionality as well as the analytical continuity of the flow. 

In this connection, it may be emphasized that the solution used here of our funda- 
mental equation (14.2) is expressed in a closed form but not in a form of infinite series 
and therefore it is quite free from the question of convergence. 

Lastly, it may be added that the theoretical results obtained in the above for the 
flow of our hypothetical gas seem to be still valid, not only qualitatively but also quanti- 
tatively, for the flow of the real gas subject to the exact isentropic law, because, in the 
flow treated above, the maximum speed of flow exceeds the local speed of sound by 
only about 10 per cent even at the so-called shock Mach number and consequently, our 
second hypothetical gas as employed in the present Part III can approximate very 
satisfactorily the real gas obeying the isentropic law throughout the whole field of 


flow, as is seen clearly from Fig. 21. 
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MINIMAL PROBLEMS IN AIRPLANE PERFORMANCE* 


BY 
BORIS GARFINKEL 
Ballistic Research Laboratory, Aberdeen Proving Ground 


Abstract. We develop here the theory of operating an airplane so as to minimize an 
arbitrary function of the end-values of the generalized coordinates. A propeller-driven 
airplane is treated as a particle in equilibrium, subject to the forces of drag, lift, thrust, 
and gravity. We assume that the specific fuel consumption is a function of the power 
only, and that the available power is independent of the altitude. 

The problem is shown to be of the Bolza type in the Calculus of Variations, with the 
complications arising from the presence of inequalities, discontinuities, and variables 
whose derivatives do not enter the problem explicitly. The Euler-Lagrange equations are 
derived and discussed. 

Notation. A subscript will sometimes denote an index, at other times the argument of 
partial differentiation. A superscript dot will indicate differentiation with respect to the 
parameter ¢. The Summation Convention will be observed. In referring to equations 
decimals may be used; e.g. (59.4) is the fourth equation of the set (59). 6;; is the Kronecker 
delta. 

1. Introduction. In the absence of lateral wind we shall treat the airplane as a point, 
P, in a four-space, specified by the coordinates (7, X, Y, m). Here T' is the time, X the 
length of are of a great circle of the earth, Y the altitude; and m the mass of the airplane. 
The end-conditions prescribe the point of departure, P, , as T, = 0, X, = 0, Y, = 0, 
m, = m (0), and some, but not all, of the coordinates of the destination, P, . We seek to 
minimize some prescribed function, G, of the remaining coordinates of P, . The following 
types of problem are of obvious practical significance: 


1) G = T, , minimizing the time of flight, 


2) G = —X, , maximizing the range, 

3) G = —Y, , maximizing the altitude, 

4) G = —m, , minimizing the fuel consumption, 

5) G = —X.(ms — Main), Maximizing the “transport”, 
6) G = —(a + m,)/T, , maxmizing the “profit”. 


Regardless of the nature of G, and the end-conditions, all problems lead to the same 
set of the Euler-Lagrange equations. We shall derive the latter from the physical laws 
governing the motion of an airplane. 

2. The physics of the problem. The dynamical laws governing the steady motion of 


an airplane are 


7= D+ mgsin¢, 
(1) 
L = mg cos ¢ 


where 7, D, L are the thrust, the drag, and the lift, respectively, m the mass, g the accelera- 


*Received Oct. 31, 1949. 
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tion of gravity, and ¢ the inclination of the trajectory to the horizon. The drag coefficient, 
Cp and the lift coefficient, C;, , are defined by 
D = 1C)Spv’, 
(2) 
L= 1C,Spv" 
S being the “characteristic” area, p the density of the air, and v the air speed. Cp and C, 
are connected by a parabolic relation’ 
Cp = A+Ci/B, (3) 
A and B being a pair of constants. 
The power delivered by the propeller is 
Nell = 7, (4) 


where N is the number of engines, ¢ the “propulsive” efficiency, and II the power devel- 
oped per engine. ¢ is assumed to be an empirical function of the relative air density, a, 





e€ = ¢(c). (5) 
The specific fuel consumption, C, defined by 
dm_ i» 
aT = NCTl, (6) 


will be assumed to be an empirical function of II only; 

C = C(I). (7) 
We may impose the requirement that the lean fuel mixture be used if II < II*, and the 
rich mixture if II > II*. At the transition point, II*, the function C(II), generally, has a 


discontinuity. 
The distribution of air density will be assumed to obey the exponential law 


o = p/p =e”, (8) 


6 being a constant. The effect of wind and the effect of cowl-flaps have been treated in a 
separate paper, and will not be considered here. A zero wind implies 


ax = vp cos¢. (9) 


The variables Y and II are bounded by the inequalities: 
Y 2 0, Tite < II < | ° (10) 


The following simplifying assumptions are made: 

1) The use of a supercharger makes the range of available power independent of the 
mode of operation. 

2) The trajectory has a gentle slope: cos ¢ = 1. 


1R. von Mises, Theory of Flight, first edition, 142. 
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3. Choice of variables. We define dimensionless quantities w, 0, 7, c, Z: 
—w = log (m/m), 6 = log [C,/(AB)'””], 
(11) 
x = II/Il,, c=C/C., E = e/a . 
Here mz, is the initial mass, @ is related to the angle of attack, ¢ is the value of eat Y = 0, 
and (II, , Co) is a pair of values satisfying (7). If C has a single minimum, as is generally 
the case, we take 

Cy = Cute ° (12) 
It is convenient to use the logarithmic variables £, 7, and the derived variables 7’, go 


defined by 


£ = logz, n = loger, 
(13) 

n’ = dn/dt&, (1 — 29)n’ = 1. 

We note that at r = 1, or = 0 
c= 1, dce/dx = 0, "c/dx”® > 0, 
nf = 1, oo 0 

and 

n’ > 0, Jo < 1/2 for all £. (14) 


The latter follows from the experimental fact that | dm/dT | in (6) is an increasing func- 


tion of z. 
Dimensionless t, x, y are defined by introducing the scale factors 8 , 8; , Bz : 


T = Bot, X = 6,2, Y= Boy. (15) 
It is convenient to choose 
__ Mog 
bo = Nello °’ 
1 Ag 
a, - 1(2)"s, (16) 
B, = B. 


Two dimensionless parameters represent the aerodynamic characteristics and the 
engine characteristics of the airplane: 
a= gC B/€o 

(17) 


h (2mog)**(‘Spo) 7 A™*B-** Nello. 


Now the system of 11 equations (1-9) in 13 variables X, Y, m, v, Il, C, €, ¢, tr, D, L, 
Cp , C, reduces to a system of three equations, 
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Exe*® = x cosh 6 + y, 


ieee (18) 


= = 
w = acre”, 
where E(y) ~ 1, and can, generally, be represented as 
E =e", x == const. < I. (19) 
In the logarithmic form (18) is equivalent, in view of (13), (19), to 


¢, = 3(—w + y — 0) — log x + const. = 0, 


d, = —w — 7 + logw + const. = 0, (20) 
¢, = log (x cosh 6+ y) + n —E — xy — logw = 0, 

n(é) being a known function. Thus the five generalized coordinates, y; , where 
y, = 2, Y= Y; Ys = w, yi = 8, Ys = &, (21) 


specify the state of the airplane in terms of its position, mass, the angle of attack, and the 
power as functions of the time, y, = ¢t. Three non-holonomic equations of constraint, (20), 
leave us with two degrees of freedom. These can be realized physically by an arbitrary 
choice of 6, controlled by the “elevator’’, and an arbitrary choice of £, controlled by the 
throttle. Among the transfinite set of the pairs of functions (6(t), &(t)) we seek that pair 
which minimizes a prescribed function G(t, , x2 , Y2 , w2), Subject to prescribed end-condi- 
tions, and satisfying the differential equations (20). 

4. Variational approach. We identify our problem with the Problem of Bolza’ in the 
Calculus of Variations in the special form: ‘Required the are y;(t) satisfying the equations 


dalt; y; Yi) = Q; g= 1, +++ ,n; B72 1,'*+* , mon, (22) 
and the end-conditions 
®,(t ; y;(to)) son 0; o> l, wiadaetet < m, 
(23) 


é, = const., y;(t,;) = const., 
and minimizing the function G(t, , y;(t.))”’. 
In later sections we shall consider the effect of inequalities of the form y(t, y;) > 0, 
and discontinuities of ¢, with respect to y; . 
The solution of the problem is obtained by introducing a set of variable Lagrangian 
multipliers, \,(t), a set of constant multipliers, 1, , and constructing the auxiliary func- 
tions F, T, J as 


F=N¢bp, T=G+ ub, 


ats 


J=r+] Fd. 


St, 


°G. A. Bliss, Lectures in the Calculus of Variations, 189, e.f. 
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By differentiation we obtain 


dJ — (tT, + F = y;F,,) dt + rf. + F,;) dy;\, + [ (r,, a 4 p,,)oy dt, (25) 


since P, is fixed, and 6y = dy — y dt. Our problem is equivalent to that of minimizing J; 
i.e. satisfying 


dJ = 0, d’J > 0. (26) 
Thus (25) splits into n Euler-Lagrange equations,’ 
F,, = fF, (27) 
and m + 1 Transversality Conditions at t, , 
r,, + p; = 9, 7=@, 1, -*> ,m, 28) 
where we had set 
p, = F,,, ym |, +++ mM, 
(29) 


p=F—-yF,,, yw=t. 


We shall further assume that the end-conditions at P, separate the variables y; ; i.e. 


&, = y, — const. = 0, a=I1,-*-,rim (30) 

y assuming r values in the range 0 < r < m. Then y, exist, and can be eliminated from 
(28), yielding m + 1 — r equations 

G, +p =90, FY, (31) 


involving only the “‘free’’ variables; i.e., such y; as do not enter the end-conditions at ¢, . 
The n + m unknowns (y; , Ags) are determined by the system of differential equations 


Fy, = 2 Py, 1=I1,- ,n = 5, 

(32) 

o3 = 0, B=1,°++,m= 3, 
whose order is generally 2n + m. In our problem, however, the order is depressed by three 
circumstances: 1) y; in @g are soluble for y; , 2) there are n — m equations F,, = 0, 
k =m+1,---,n, of (32.1), for the y; whose derivatives are absent in F, 3) \,, can be 
eliminated, since F is a linear homogeneous function of Ag . The resulting order is 2m — 1, 
so that 2m — 1 initial constants must be furnished. m such constants are y;(0) = 0; 
m — 1 additional constants may be chosen as gg(0), 8 = 1, +--+ , m — 1, where we define 
9s = Ap/QWwm - (33) 


These parameters. gs(0), of our family of extremals can be determined from the condi- 
tions of the particular problem. For, there exist r end-conditions, (30), andm +1—r 
Transversality Conditions, (31), connecting the m + 1 quantities t, , \w(t2), gs(0). 


si bide mM, 202, ef. 
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5. Euler-Lagrange equations. An extremal, generally, is compounded of ares lying in 
the interior of the admissible region y > 0, and of ares lying in the boundary y = 0. In 
view of the inequalities (10), prescribed in our problem, we must distinguish arcs of the 
following types: 

A) general case, x ¥ const., y 4 0; 

B) flight under constant ae r, r = const., y ¥ 0; 

C) level flight,  # const., y = 0. 

These arcs are joined together in accord with the Corner-Condition, discussed in 
section 8. 

In order to take into account inequalities y > 0 we augment F so as to include the 


constraints y = 0, by writing 


E — ake 5] 
os = od; = bY; 

Baas = E (34) 
Asha _ 0, Ashs = 0, 


and construct F = Ass , 8 = 1, «+ , 5, with the aid of (20), (33): 


F = {(9. + 9s — Hy — (gi + 2G2)0 — gi(0 + 2 log x) + 


(35) 
- log (x cosh 6 + y) + (1 — 29.)(n — logw) — (1 ¥ 29,)&} 
The Euler-Lagrange equation in @ is 
g(x cosh 6 + y) = zsinh 8, (36) 
which, in vew of (18), is equivalent to 
p = be BP og, = Be sinh 9 (37) 


S) 


p(w, y) being defined by (37.1). The Euler-Lagrange equations in z, y, w, can be simpli- 
fied by eliminating from them 2, y, » with the aid of (18), and then making use of (37) 


and (13). On introducing the constant 


bolt 


a = } log3 = coth™ 2, (38) 
the final result can be written as 


A; sinh (6 — a)/Eme* = const., 


£ log (\;/E7e*) 


E 
, Ps (9: + 9s — &)y 


(39) 
As( + 292), 


II 


f f,(1 — 292) /acre”] 


Il 
= 


n'(Go — G2) + 9s 





MINIMAL PROBLEMS IN AIRPLANE PERFORMANCE 155 





1951] 








Moreover, since F does not contain ¢ explicitly, there exists the integral F — y,F,, = 
const.; 1.e., 
























A3 (1 — Jo) = const., (40) 


which we shall use in place of (39.3). 
The Lagrange multipliers can be eliminated from (39) as follows. First \, is eliminated 
among (40), (39.1, 2); next g, , gs are eliminated with the aid of (34). Then (39) becomes 


Jo = g: + K sinh (6 — a)/Erne’*, 


(92 — go)(x — const.) = 0, (41) 


d r E 
42 log sinh (@ — a) + on (a — o} = 0, 
K being a constant. Finally, g, and g,. are eliminated from (41.1,2), with the result 
(gor — pe’ sinh @ — K sinh (6 — a)/Ee*)(x — const.) = 0. (42) 


Equations (18) can be written, in view of (37), as 


dt 1 


dw acre” ’ 


Gee” 9/2 
dx _ Epe (43) 


Fa ’ 


dw acr 


7 -6/2 
D a F(t — © — cosh a). 
dw ac T 


6. Computational procedure. The airplane is represented by three constants a, b, x 
and the function c(7). These automatically define the functions 


c= c(x), Jo = go(m), E= Ey), (44) 


in view of (13), (19). We recall that g, , p are known functions of y; : 


p= be @**””? /E(y), nu = << sinh 6. (37) 
Thus (41.3), (42), (43) is a definitive system of five equations, and determines f, x, y, 0, x 
as functions of w, provided five initial constants are furnished. Three such constants are 
t(0) = 0, x(0) = 0, y(0) = 0, prescribed by the end-conditions; two additional constants 
may be chosen as 6(0) and K, leading to a two-parameter family of extremals 


y;, = y(t; 000), K); t=1,+++,5 


Since t, x do not enter the system explicitly, the corresponding equations (43.1, 2) can 
be split off, and done by quadrature after the solution for y, 6, r has been obtained. 

Thus in the general case, A,  # const., y # 0, our system of equations consists of 
(37), (44), and 
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dy E pe” . 

—~=-—j{1-— cosh 6], 

dw ac T 

d : K . 
— log sinh (6 — a) + —(g, — x) = 0, (45) 
dw ac 


gor = pe’? sinh 6 + K sinh (6 — a)/Ee® 


A choice of 6(0) and A determines in succession (0) and g,(0) from (45.3) and (37), so 
that the integration may proceed. 
In case B (45.3) is replaced by x = const., and K can be discarded. In case C, y = 0, 


we have g,; = tanh @, and 
p = be **”, 
—6/2 > 
© = pe’ cosh 6, (46) 


gor = pe’ sinh 6 + Ke™® sinh (6 — a). 


Here a choice of K determines @ and x when y and w are known. In both the special cases, 
therefore, the number of initial parameters is reduced to one. 

The parameters 6(0), A for a particular problem can be determined from the end- 
conditions and the Transversality Condition at ¢t, . In order to make use of the latter it is 
necessary to consider first the Sufficiency Condition and the Corner Condition. 

7. Sufficiency condition. In the notation of Bliss the sufficiency conditions for a weak 
relative minimum are I, III’, IV’. Condition I is met by the solution of the Euler- 
Lagrange equations. The strengthened Legendre-Clebsch Condition,* III’, is equivalent to 


F 2, 52 62, > 0, (47) 


where z = (y¥;,¥%);7 =1,°°:,mk=m-+1, . the set of the highest deriva- 


tives entering F, and 6z satisfy the differentiate ry equations of constraint, 
ds:, 52; = 0, B= 1, +++. mem 1, «oo 0 me. (48) 


Applying (47) and (48) to (35) and (20), respectively, and eliminating éy; , we are led to 


the requirement that 


¥ — (1 + 3c) 867 + Qn! i (gon) se} > 0 (49) 
be a positive-definite quadratic form. Since p, 7, 7’ are positive, we deduce 
ds > 0, ] 
f (60) 
S (Gn) > 0} 
(50.1) is equivalent to 
Aa(te) > O (51) 


‘“ihidem, 235 





1951 MINIMAL PROBLEMS IN AIRPLANE PERFORMANCE 157 


For, y,; and y; are bounded by physical considerations, and g, , g. are bounded in view of 
(37) and (41.1). Hence, (39.3) implies that \, cannot change its sign. 





(50.2) restricts the range of r. Generally, g.7 has a single minimum at some value 


7 <1. Then we construct a new lower bound, z/,;, as 
, " = 
Tmin — Max (Tin ’ 7). (52) 
As a numerical example; let us take 


c= 4/(r+ 138-7), if e<ae* = 1.5 (lean mixture), 


0.5(r + 1), if x>1.5 (rich mixture), (53) 


° 
Il 


go = x(x — 1)/(x + 8), eo. by + br 3)/@ +3" if «r < 145, 


i dr 
(54) 
ddo . . 
Jo = 1/21 + 2n), , = 1/2(1 + 2r) if x > 1.5, 
7 
and (50.2) becomes 
r+ 9r —6>0 

so that 7 = 0.638, and 7/;, = 0.638. 


Jacobi’s Condition,’ IV’ has received in our treatment only an empirical verification, 
in the fact that the family of extremals covers simply the admissible region of space. 

8. The Corner condition. Consider an extremal compounded of the branches £,, and 
Ey. joined at t = t,. If the junction point lies in the surface ¥(y,) = 0, then its coordinates 
satisfy 

v,, dy; = 0. (55) 
Now applying (25) to the two branches, respectively, and adding the results we obtain 


the additional term 


AJ, = —[(Ap,;) dy;|. , 7=0,1,---,m. (56) 
where we define the “jump” in any function, f, as 
Af = lim [f(t + 2) — f(t — 0] = fa — f- (57) 
If J is minimized it is necessary that AJ») > 0, subject to (55). This leads to AJ, = 0, and 
Ap; = wy; » 
(58) 
0 = UW, ’ 


u being a constant, and p, = 0. (58) can be regarded as an extension of the Weierstrass- 


Sibidem, 258. 
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4 4 °,° 6 . . . ° 
Erdman Corner Condition.” If we now impose the requirement Ay; = 0, dictated b 
| J} ? 
physical considerations, we shall have the system of equations 


Ap; = uy, ’ J _ 0, eS ES 


0= ny, , 
Ay; = 0, (59) 
do, = 0, B=1,°:+,m, 
F,, = 0, k=m+1,-*-,n. 


The last line gives the Euler-Lagrange equations in the variables y, . 
Suppose £,, lies in the admissible region y > 0, and EF). lies in the boundary y = 0. 
If F is of class C’ at t, , and satisfies the Legendre-Clebsh Condition, it can be shown that 
the only solution of (59) is the trivial solution 
Ay,=0, Ay, =0, Ay=0, w=O0. (60) 


This is the, so-called, Tangency Condition, which requires that all variables and first 
derivatives appearing explicitly in F be continuous at the junction. Moreover, for the 
branch £,. we have 


v,, by; 2 O, (61) 


which, in view of (25) and dJ > 0, leads to the Convexity Condition, 


os 
Fy — 5 Po + AY, = 0, 
(62) 
A(t) < 0. 
A(t) can be readily identified with the Lagrange multiplier of the constraint y = 0, 


entering the augmented function § = F + dy. The point at which \ = 0 may be termed 
the point of “‘inflection’’. 
In our problem there are two inequalities y > 0; (34) gives 


Vixz=d&, Yo = ds. 


Then (62.2) requires that A, < 0, A; < 0. Since 4, > O by (50), we obtain from (33) 


Now (39.4), in view of (14), yields 
Yo > G2 if _ Tmin ? 
Jo = J2 if Tmin x cs 2 Tmax » (63) 
Jo = G2 if T = T max , 





Sibidem, 12, 203. 
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while (39.2, 1) leads to 





2 (9, — x) + ad log sinh (@ — a) > 0 if y=0. (65) 
ac dw 


The latter determines the point of inflection, beyond which minimal level flight cannot 
proceed. 

Application of the Corner Condition to the discontinuity at § = &* will be made in 
the Appendix. 

9. Transversality condition. There are two inequalities affecting y; at 4: y > 0, 
Wmax — #& > 0. Thus the function G in (31) is to be augmented by writing 


G=G+t uP. ; a=1,2 
®, = Y; ®, = Wmax —~ W (66) 


My = 0, Ho(@max sd w) = 0. 


Then (31) becomes 


Il 
So 


G, — 2;K sinh (6 — a)/Eme* 


II 
S 


G. — d, sinh (6 — a)/Ene” sinh a 
(67) 
y(G, + r;/Ene*) = 0, 


IG. — ds(1 — 2g2)/acre* (max — w) = 0, 


upon elimination of y, , ue With the aid of (66). 

We shall now consider problems of the type G = +y, , r assuming any one of the 
values 0, --- m = 3. ThenG,, = +6,,; . Observing that a, a, w, 7, c, E are positive, we 
deduce from (67) and (51) the following Table: 


Transversality Condition at ¢, 


Free Variable IfG = y; IfG = —y; lf G,, = 0 
Y=t K(@ —a) >0 K(@ — a) <0 K(@ — a) =0 
Y, = 2 @>a @<a 6=a 
Yo = y = — y=0 
¥3 = w 92 <3 92 > 3 92 = 3 
Or W = max - 


In the last column the variable y; is ignored at ¢,; i.e., it is an argument neither of G nor 


of the end-conditions at t, . 
If the entire extremal is of the type B, r = const., then dt and dw in (25) are no longer 
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independent, but connected by dw = acre dt. As a result, the first and last rows of the 


Table coalesce into 


1 1 jaa . _ 
29 gi > 2. fi = 2 OTW = Wmax - 


f . 

¥3 = w (68) 
In applying the Table we make use of the following Remarks concerning minimal 

curves: 


1) ‘‘@ and g, have like signs” 


2) “K(@ — a) = 0 implies g, = g. , and conversely” 

3) “@ < aand g, > 3 implies y <9” 

4) “g. > 4 implies t = max” 

5) “g. decreases in passing through the value 3 provided g, > —1” 
6) ‘The quantities \; , 8 — a and g, — g, cannot change their signs” 


The proofs depend on (37), (41.1), (43) and (38), (64) and (14), (39.3), (39.1) and (41.1), 
respectively. In virtue of the Remarks 6 and 2 the first two lines of the Table hold for all 
t<c. 

We shall next illustrate the use of the Table by considering a few special cases. 

10. Special cases. 1) Problems not involving the time. 

Here G, = 0, so that g; = g. , and K(@ — a) = 0 for all ¢, there being but a single 


parameter, say 7(0). In the case A (45.3) becomes 
gor = pe’ sinh @. (69) 


In the case C, y = 0, (46) reduces to 


—(30+6)/2 
x = be ? cosh 4, 


(70) 


go = tanh 6. 


Here 

Jo= Hh = G2; (71) 
so that the number of parameters is reduced to zero; the family shrinks to a single curve, 
which can be calculated by quadratures. For, eliminating @ in (70) and (43) gives 


a 2/3 2/3 
ia log r*(1 + go)*(1 — go) + log b°* = Qr) + log b, 


(= b~* *r(1) + k, 


z=X(r) +k. 
+, X are functionals of c(w), and can be tabulated as functions of 7. k, , k, are determined 


from the initial conditions. 
Let us next consider the problem of maximizing the range, x, when ft, y, w are ignored. 


Then G = —z, and the Table gives 


6<a, K=9, —-9.=0 for all ¢, 
y = 0, @ = Wmax or G2 = 4 at t.. 
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The alternative g. = g, = } at ¢, must be ruled out if y(0) = 0. For, then g, = g, > } 
at t < t, by Remark 5, and hence y < 0 by Remark 3, so that the inequality y > 0 is 
violated. Thus our solution is the curve for which y = 0 when w = wax - 

2) Problems not involving the range. 

Here G, = 0, so that 6 = a for all t. Then (45) is to be replaced by 


dy _ Al _— 2| 
dw ac . 2(3) |’ 


(73) 


(t — Textr)(Gor — 3°°"p — K’'/Ee’) = 0, 


where the constant K’ = K sinh (@ — a) need not vanish. 
Consider the problem of the most economical climb to a given altitude. We set 
G = w, G, = G, = 0, and note that 6 = a, K’ = 0 for all ¢. Hence (73.2) becomes 


(Gor _ 37°») (4 — Tente) = 0. (74) 


The first factor has only one root, 7, , since z is a single-valued function of gor by (50.2). 
The solution of (74), subject to (64) is the point in the interval (ain , Tmax) nearest to m, . 
Having determined z, we proceed with the integration of (73.1), obtaining the unique 
solution of the problem. 

Similarly, the problem of the fastest climb is solved by setting G = t, G, = G, = 0, 
and noting that @ = a, K’ > O for allt, and g. = 3 at t, . Then from the Remarks 1, 5, 4 
and the equation (68) it follows that for all t we must have + = mex 5 9i < }- 

11. Summary. The Euler-Lagrange equations, derived in section 5, can be readily 
integrated by the procedure of section 6, leading to a two-parameter family of curves. 
Such a calculation was carried out, for a typical airplane, by the Differential Analyzer at 
the Ballistic Research Laboratories in 1948, using ten integrators. The complications 
arising from the presence of inequalities and a discontinuity were easily resolved by apply- 
ing the Corner Condition, discussed in sections 8, 12. 

The family of curves so constructed represents the totality of solutions of all possible 
problems under consideration. The process of selecting from the family the curve that 
solves a particular problem is carried out by scanning the end-conditions and the Trans- 
versality Condition at ¢, . The latter is discussed and illustrated in sections 9, 10. The 
Sufficiency Conditions for a weak relative minimum can be, generally, satisfied, as shown 


in section 7. 
APPENDIX 


12. Effect of discontinuities. Applying the results of section 8 we consider the case 
in which £,, lies in the region y > 0 where y = 0 is a surface of discontinuity of F with 
respect to the arguments of y. Two possibilities arise: 

a) Wis not a function of y, , so that y,, = 0, and (59.2) drops out. Then the system 
(59) reduces to n + 2m + 2 equations among the n + 2m + 2 unknowns (Ay; , Ay; , 
Ads , »), and may be expected to have a non-trivial solution at a given t = t. 

b) Wisa function of y, , so that (59.2) contributes u = 0, over-determining the system 
by one equation. Therefore, at t) there exists, generally, only the trivial solution (60). 
Then the extremal must enter the surface y = 0 by a continuous transition, and remain 


in Y = 0 until it reaches some point t = ¢¥ , where a non-trivial solution exists. At that 
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point there occurs a discontinuous transition, which could be termed “delayed refrac- 
tion’’, or “‘reflection’’, according as ¥ = 0 is crossed or not. 
In our problem a discontinuity occurs at the surface 


y= —£=0 


Since £ does not appear explicitly in F, we are dealing here with a transition of type b. 
The system of equations (59), in view of (75) and (35), reduces to 
Aé@ = A(gir) = A(gar) = A[(1 — 2g.)/c] = 0, 
(76) 
(92 ss Jo) (ar oe Textr) = 0. 


From the last three equations of (76) we deduce 


{(") —. (* ae) \ a — Texts) = 0, 


C. — C- (77) 
(cr) = (cr) - ; 





2(g2)+ = (7-) 


(gor), = (gom)_. 


which, in conjunction with (64), determines 7. , (g2). , (g2)- . These three values are 
invariant, being dependent only on the function c(7). 

As an example, let us suppose that in (53) the discontinuity at x* is approached from 
the “lean” branch of the curve. Then z_ = 2* = 1.5, c_ = 1.067, and the first factor of 


(77.1), equated to zero, is 


0.5(7 + 1) — 1.067 
x — 1.5 


whose root, + = 2.24, falls outside the admissible range of x. The second factor of (77.1) 


—05r=0, if w>1.5, 





1.5 


leads to two solutions: 


1) T. = Tmax = 2, Cy 1.5, (g2)+ = 0.232, (go)+ = 0.200, 


T+ = Tain = 0.5, Cy = 1.067, (92)+ = 0, (Jo) + = 0.077, 
the values of c and g, being obtained from (53) and (54). Since only the first one of these 
solutions satisfies (64), we take 
wr, = 2, (g2)4 = 0.232, (g2)_ = 0.309. 
After entering the surface r = 7* = 1.5 (lean), the extremal remains there until g, reaches 
the value (g.)_. At this point, which can be observed by means of (40.1) during the com- 
putation, a delayed refraction occurs, r jumping to 7, = 2. 
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HEAT TRANSFER BETWEEN SOLIDS AND GASSES 
UNDER NONLINEAR BOUNDARY CONDITIONS* 


BY 
W. ROBERT MANN 
University of North Carolina 


AND 
FRANTISEK WOLF 
University of California at Berkeley 


1. Introduction. In the theory of heat transfer between solids and gasses, it is com- 
monly assumed that the rate of heat exchange across a gas-solid interface is proportional 
to the difference between the temperature of the solid surface and that of the ambient 
gas. This assumption is known as Newton’s Law of Cooling and it gives rise to a boundary 
condition of the following general form 


aU 
where k is the thermal conductivity of the solid; 9U/dn is the thermal gradient at the 
surface evaluated from the interior in the direction of the outward normal; AU is the 
difference in temperature between the surface and the gas, considered positive when the 
solid is warmer than the gas; and f is the factor of proportionality, frequently referred to 
as the film transfer factor.** If f is a constant the above boundary condition is linear. 
At ordinary temperatures, where most of the heat transfer is due to conduction-convec- 
tion, f varies but slightly with temperature and it is not a bad approximation to regard 
it as a constant. At higher temperatures however, most of the heat is transferred by 
radiation and the film transfer factor varies greatly with temperature. Neglecting con- 
duction and convection, we find from the “fourth power law” that f, , the film transfer 


factor due to radiation, is given by 


T: — T: 
f, = Ae T.-T? (2) 


where 7’, is the absolute temperature of the solid surface, 7’, the absolute temperature 
of the ambient gas, « the emissivity, and A is a constant depending upon the units of 
measurement. Even when all the heat exchange is by conduction-convection the film 
transfer factor, f. , changes somewhat with temperature but the dependence is much 
more complicated than equation (2) and is expressed in the form of empirically de- 
termined relations between certain dimensionless moduli. f 

Henceforth we shall not be concerned with any particular form of the relationship 
between f and the temperature. The important point is that when the film transfer 
factor is a function of the temperature, the boundary condition (1) becomes nonlinear. 
It is our purpose in this paper to investigate a nonlinear boundary value problem under 

*Received Nov. 8, 1949. 

**L. M. K. Boelter, V. H. Cherry, H. A. Johnson, Heat transfer, University of California Press, 
pp. IIb-1, IIb-2. 

+Boelter, Cherry, and Johnson, op. cit., Chap. XII. 
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the most general physically significant relationship between the film transfer factor and 
the temperature. 

2. Statement of problem. In order to keep geometrical considerations as simple as 
possible we shall consider only the semi-infinite solid. To simplify further the non- 
essential aspects of the problem, we shall assume the gas to be maintained constantly 
at unit temperature. This will enable us to regard the film transfer factor as a function 
of the surface temperature alone. Requiring that the initial temperature of the solid be 
zero throughout, the problem of finding the ensuing temperature-time distribution, 


U(x, t), in the solid can be formulated as follows: 





U(x, ) = U,Az,), 2>0,t>0, (3) 
U(a, 0) = 0, (8a) 

soe — U(0 TU(O, b : : 
-~U.0, ) = LU nA )] _ evo, 9d), (3b) 
U(r, |<M>1 z>0,ti>0. (3c) 


As usual. the following functions will be assumed to be continuous for the values of x 


and ¢ indicated: 


U(a, b for 22 0,%t2> 0, (3d) 
U2, bt) for z>0,t> 0, (3e) 
U..(z, t), U.(2, for z>0,t>0. (3f) 


Equation (3) is the well-known heat flow equation, where the units of time and 
distance have been so chosen as to make the diffusivity 1. (3b) is the special form of 
boundary condition (1) corresponding to this particular problem. Observe that —kU,(0, t) 
is the rate of heat flow per unit area into the solid from the gas. The function G[U (0, 2)], 
which is proportional to this rate of heat exchange, occurs continually throughout the 
following work and will be referred to as the input function. Condition (3c) serves the 
double purpose of restricting the behavior of U(z, t) as « tends toward infinity, and of 
excluding the possibility of an instantaneous heat source at the surface when t = 0. 

To complete the statement of the problem, some hypotheses must be made con- 
cerning the input function, G[U(0, t)]. We know from experience that heat transfer 
takes place in a continuous manner; that a net exchange of heat takes place between 
two media only when they are at different temperatures; and that the net rate of heat 
transfer is a monotone increasing function of the difference in temperature between the 
two media. Referring to the definition of the input function above, we see that in any 
physically significant problem the following three hypotheses must hold: 

] is continuous for all U; 


A. G[U] 
B. G[U] is zero when U = 1; 
C. G[U] is a monotone decreasing function of U. 

In the following work we shall repeatedly invoke the above hypotheses, especially in 


Sec. 5. 


cT 


c 


1951] HEAT TRANSFER BETWEEN SOLIDS AND GASSES 165 


For a brief summary of the principal results obtained, the reader is referred to Sec. 6. 

3. Reduction of the problem to a nonlinear integral equation. The temperature dis- 
tribution in the solid, which is the unknown quantity in the problem stated in Sec. 2, 
is a function of the two variables, x and ¢t. We can easily show, however, by Duhamel’s 
Principle for example, [1] that U(2, t) is completely determined by the surface tempera- 
ture, U(0, t). This leads us to expect that the problem admits a more concise formulation 
in terms of the function of a single variable, U(0, t). We shall effect such a re-formulation 
by means of the Laplace transform. This well-known transformation will not, of course, 
eliminate the essentially nonlinear character of the problem but it does restate it in 
terms of a nonlinear integral equation for U(0, t). 

In view of the conditions imposed upon U(z, #t) in Sec. 2, it is easily verified that the 
Laplace transformation with respect to ¢ carries (3), (3b), and (3c) into the following 
linear ordinary boundary value problem: 


Uz.(2, 8) — su(x, 8) = 0, (3’) 
u(0, s) = —L{G[UO, ]} = —g), (3b’) 
M , 
| u(x, s) | < for «>0. (3¢’) 
The general solution of (3’) is 
gi/2 


\ —zai/2 z 
u(x, 8) = Ae~**’’” + Be*’’’," 


where A and B may depend on s but not on x. (3c’) requires that B = 0 and from (3b’) 
we find that A = g(s)s'””. Hence we get that 


u(z, 8) = a eos - 


for the solution of the transformed boundary value problem. Inverting (4) by means of 


the Borel formula [2] we obtain 
, _ f' Gv, x] —2° 
U(x, ) = J, et — 1)” exp 4(t — 7) dr (5) 





which expresses U(x, é) in terms of all the values of U(0, 7) between 0 and ¢. We can 
now prove the following theorem: 

THEOREM 1. A necessary condition that a function y(t) should be a surface temperature 
function for the problem stated in Part II is that it satisfy the following singular, nonlinear 
integral equation of the Volterra type 


‘ Gly(x)] 
y() = I AGE — pF dr (6) 
and that 
ly|<M t>0. (7) 


If | y(t) | < 1, then equation (6) is also a sufficient condition. The proof of the theorem is 
facilitated by the following lemma: 

Lema I. Any function, y(t), satisfying both (6) and (7) is continuous for t > 0 and 
y(O) = limyo+ y(t) = 0. 
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The proof of Lemma I is a straightforward exercise in elementary calculus and will not 
be included here. 

To establish the necessity condition stated in Theorem 1, we assume that y(t) is a 
surface temperature function for the problem under consideration. This means that 
there exists a function U(2, t) satisfying the conditions (3) through (3f) and having 
the property that y(t) = lim, U(x, t) = U(O, t). Applying the continuity condition 
(3d) to equation (5) we get directly that 

at 
_G[U(Q, | (0, 7] 
v0, » = | M 
0, 0 I, w(t — 7)” 
and since U(0, t) = y(t) we see that (6) must hold. (3c) and (3d) give us (7) immediately. 
This completes the proof of the necessity. 

Turning now to the proof of the sufficiency condition, we assume that y(t) satisfies 

(6) and (7) and show that the function defined as follows 
Cly(7)] —x 


U(z, t) = J, 7a — np? Pag— a dr 


3 dr, 


is a solution of the problem stated in Sec. 2. The proof consists in verifying that U(z, t) 
defined above satisfies the conditions (3) through (3f). Most of these verifications are 
trivial and will be omitted. We do include however the proof for condition (3b) which 
is not completely obvious, and that for (3c) which will make clear why the sufficiency 
is proved only for | y(t) | < 1. 
Considering first (3b) we differentiate (8) to get 
x Gly] —2x° 


r ( " De ees f i ALPS OD ‘xp ————— : 
U2, t) ni? if a — ye exP 77 a dr 


Since both this integral and that on the right in (8) converge uniformly in ¢ on any 
interval of the form 0 < ¢ < T for any x > 0, the above differentiation is valid. Making 
the substitution & = 2°/4(t — r) the above becomes 


Uz, ) = -25 | “s al u( = =) —§2 dé. 


Form now the following difference 


aa} [ Gly(t)Je** dé — [ al + - rr :) f fei ae|, 


2/2t*/2 


We wish to show that for any given fixed t > 0 one can find a 6 such that |z| < 6— 
A < e where «¢ is arbitrarily small. To this end we consider the following inequality 


A<A,+4,+4;, 


=| tlle ae) - amc 
=| [° {err - of oe - 2) fore ae, 


z/2t%/s 
/ Gly(dJe** dé . 


0 


where 


? 





ad 
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By the continuity of G and y (see Hypothesis I and Lemma I) we know that there exists 
a bound G, such that A, < 2G6. Hence, when @ = «/6G 

A, < ¢/3, forz < ¢«/3Gt”. 


Similarly, there exists a number 7 such that 





x | ( | | € 
we <9 Gly] — G| Wt — ge <3: 
On the interval @< t < o&, 
2 2 
x x 
rae < —— 
4° — 40° 
x” € 
Em ws & ows 
492 < 1% < 260 36" i 


By taking x < (€/3G)n'” we have 


e f° -# . 
a<ff e dE < 3- 


3 “0 
Turning now to A; we can write 
2/36 


' =g® r— —¢? € 
A; < [ Gly()le* dt < a e* db< 3° 


Hence, if we choose 6 to be the smaller of ¢/3G n'” and ¢/3G t'” we get 


zt<é6>A<e 
This implies that 


meino= ted add 2) 
lim U,(z, t) = lim - a dl y t 1 e* dt 


zn 


52 | Gty(oie® ae = 5A Gty(o) [oe at. 


T ) 


Therefore, 
U.(0, ) = —Gly(d] 


Considering now condition (3c) we recall from Hypotheses II and III that | y(#) | < 1 
implies Gly] > 0. Making use of the fact that G[y] does not change sign on the interval 
of integration, we can invoke the theorem of the mean for definite integrals [3] to get 


2 


1U Se eae ae —z ‘_ Gly] 
Pai i tn , nr *(t — 7)” oe 4(t — 7) = a» 4(t — 7) I w(t — 71)” on, 


“0 








where 0 < 7, < l. 


By (6) this becomes 
2 


U(x, t) = exp Frese <y) <1. 


The remaining steps in the proof of Theorem 1 are somewhat lengthy but not difficult 
and we shall leave them to the reader. 
The result of our work so far has been to show that any solution of equation (6) 
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which is bounded in absolute value by 1 will yield, through formula (8), a solution of 
the nonlinear boundary value problem stated in Part II. We shall see later that when 
the input function, G[y], satisfies a Lipschitz condition on the unit interval, the nonlinear 
integral equation (6) and the differential boundary value problem are completely 
equivalent, i.e. every solution of one will give a solution of the other. From now on we 
shall be concerned exclusively with equation (6) which we shall refer to as the surface 
temperature equation. 

4. The linear case. Before studying the surface temperature equation in its most 
general form, we shall consider briefly the important special case in which it is linear. 
This case arises when the film transfer factor is assumed to have a constant value, say 
fo. The input function G[y(é)], then becomes 


Het = h,[1 — y(t] 
and equation (6) reduces to 
* 1 — y(7) ; 
= Io my ee E ° (6* 
y(t) =h | TAG — 7)? dr i*) 


The above equation is easily solved by the Laplace transform and gives 


/ = ia 2 ig h*or 1/2 / 
y(t) = T(3/2) ho | € erfe (hr ’*) dr, (9) 


2. FF aw Sf" a 
where erfe(x) = 1 — erf(x) = 1 — =z e"° dv =-7 [ e” dx. 
ee ae ge 

In the more general case where (6) is nonlinear, the Laplace transform will no longer 
be useful, since the transform of the product of two functions cannot be expressed in 
terms of the transforms of the individual functions. For this reason, we solve (6*) by a 
different method which can be carried over to the nonlinear case. This second method 
is that of successive approximations. We begin with the function y,(¢) = 0 as the first 
approximation to y(t). We then construct the sequence of functions {y,(t)} defined by 
the recursive formula 











at sca y,(7) 
Yn+i(t) = No | PAG — 7)? dr. (10) 
Making use of the formula 
i ” dale T'(n ‘2 + 1) (n+1)/2 
— wdr=— , ( 
| [r(t — 7)]'” Gr I'(n/2 + 3/2) ; . a) 
we obtain 
— ho 1/2 
= 
y2(t) — T'(3/2) (2) t, 
oe ho 1/2 hs hs 3/2 
WO = Tey! — Ta)! t+TE6/H° 


etc. 
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In this way we are led to the series 


ro) 


k+l hs k/2 
y(t) = 2 ( 1) T(k/2 + Tih (12) 

which obviously converges uniformly on any interval of the form 0 < ¢ < T. We can 
therefore verify that y(é) as given by (12) satisfies (6*) by substituting it into that 
equation and interchanging the order of summation and integration. 

We shall see in Part V that the above well-known procedure, with a slight modifica- 
tion, can be successfully applied to the nonlinear surface temperature equation. 

5. The nonlinear integral equation. In this section we shall study the following sin- 
gular, nonlinear, integral equation of the Volterra type 


y(t) = I Se dr. (6) 


We showed in Part III that it must be satisfied by the surface temperature, U(0, t), of 
the problem stated in Part II, and we therefore refer to it as the surface temperature 
equation. If we can find a solution of (6) which is bounded in absolute value by 1, then 
by formula (8) we can construct a solution, U(z, t) for the problem stated in Part II. 
Furthermore, if (6) can be shown, under certain conditions on G, to have a unique 
solution and this solution is bounded in absolute value by 1, then it will follow that under 
these same conditions the original heat transfer problem has a unique solution. All our 
results will be obtained by exploiting Hypotheses A, B, and C of Part II and it is im- 
portant that the reader always keep these clearly in mind. 

First we shall prove an existence theorem stating that (6) always has at least one 
solution which is bounded in absolute value by 1. To this end we introduce the following 


functional transformation 








te = [ or haar, (13) 


7 





where 
G[z(t)] when 2(t) < 1, 
G*[2()] = 
0 when 2(é) > 1. 
Since G[1] = 0, G*[z] is continuous. Now pick T > 0 arbitrarily large and regard it as 
fixed. Henceforth we shall confine our attention to the interval 0 < ¢ < T. 
Consider the sequence of functions defined by the recursive formula 
Znai(t) = Slz,](d), where 2,(t) = 0. (14) 
We can evaluate z,(t) explicitly since G[z)(t)] has the constant value G[0] which we shall 
denote by G, . We get 
= _ G&G 1/2 
z,(t) —_ T'(3/2) t ’ 
z,(t) cannot be evaluated without knowing the input function, G[y], but the following 


“2 


lemma enables us to infer some important facts about the behavior of the sequence 
{z,(t)}. 
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Lemma II. If u(t) and v(t) are two functions which are continuous for t > 0, which satisfy 
the inequality u(t) > v(t) for allt > 0, and if u(O) < 1, then 


S[uj(t) < 3[v](d), for all t> 0. 


The above lemma is a very simple consequence of the monotone decreasing nature of 
G[y] stated in Hypothesis C. Since z,(0) = z,(0) = 0 and since z,(t) > z(t) for allé > 0, 
it follows from Lemma II by the method of induction that the following inequalities 


hold for all ¢ > 0. 


z,(t) > z(t) > ee Zon+1(t) ee ne (15) 
Zo(f) < 2,(8f) < -** < 2q,(t) <-> (16) 

and also 
Zonsi(t) > Zo,(), (17) 


where 7 and k are independent positive integers or zero. Since z(t) = 0, it follows from 
the above inequalities that for all n > 0, z,(t) > 0 for all ¢ > 0. The first few of these 
functions are sketched in the figure below. 


Z ,(t) 
Z3(t) 





Z a(t) 
Zo(t) 

















0 Zolt) t 
Fia. 1. 


All of these functions are bounded from below by z(t) and from above by z,(?). 
Therefore, on the arbitrary fixed interval [0, 7], they are equibounded. We shall find 
shortly that the {z,(é)} are also equicontinuous. From this it follows by Arzela’s theorem 
that the closure of the set {z,(¢)} in the usual topology is compact. It is easy to see, 
further, that the {z.,..,(¢)} converge uniformly from above to some function, say u(t), 
and the {z,,(é)} converge uniformly from below to a function, v(t). If u(t) = v(t), then 
we have a fixpoint of the transformation (13). If u(t) and v(t) are not the same function, 
then each will be a fixpoint of the square of the transformation (13) but neither will be 
a fixpoint of the transformation itself. 

In order to prove that the transformation (13) does have a fixpoint, we enlarge our 
attention to the collection of all functions which are continuous on the arbitrarily large 
fixed interval [0, 7]. We consider these functions as elements of a Banach space, @, by 
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introducing the usual topology, i.e. we denote the norm of an element f by || f || and 
define it as follows 


| f |] = lu.b. | f( | on [0, 7]. 


By the distance between two elements f and g we mean the norm of their difference 
lf - , and by convergence a € we mean uniform convergence. It is easily verified 
ths ut ed ‘se definitions do make @ a Banach space, i.e., a normed linear complete space. 
In this space we consider the set Z consisting of all members of @ which are non- 
negative, bounded above by z,(t) and which have a modulus of continuity given by 


| 2(t;) — 2(t) | < < (, — 4)”. 


It is easy to show that Z is convex and compact. 

We shall now show that the transformation carries H into itself. From Lemma II 
it is easily seen that the transform under (13) of any member of @ which is contained 
between z,(f) and z,(?) must also lie within these bounds. Moreover, assuming t, > 4, 
and denoting 3[z](t) by u(t), we shall have 


("Geol ar - in GeO ar | 


ge w(t Ae 7)” w(t 1) 





| u(t.) — u(t,) | = 


Fix G* T 2 -1/2 G* 
<| [SN - 9 - a - oar + | fo Oba 





2G, 1/2 g 1/2 2G, s2 
” | <3 (8? — (b — t)'? — 47) | + Sa (bh — t)” 
T T 
since G*[z()] < G{0] for all t > 0, and G[0] = G, . From here we find that 


| u(te) — w(t) | < 488 (te — 4)”. 


This means that u(#) has the same modulus of continuity as that prescribed for the 
members of Z. In other words, we have shown that 3 carries H into a subset of itself, 
namely Z. Moreover, it is easily seen that 3 is continuous on H, since if u and v are any 


two members of H, then 


| 3[u}(t) — 3fo](é) | = | [ le a = Fila) 4 dr | 


‘ Glu*(x)] — Glo*(7)] 5 | 
I w(t — 7)” ° 





u(t) when u(t) < 1 
where u*(t) = (18) 
l when u(t) > 1 


From this and the continuity of G, it is very simple to show that 3 is continuous on H. 
Summarizing the above results, we have that 3 is a functional transformation which 
carries a convex, compact set H of the Banach space @ into itself continuously. Using 
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Schauder’s generalization from Euclidian space to Banach space of the Brouwer fixpoint 
theorem, [5] we can infer that 3 has a fixpoint in H. This result will be stated in the fol- 
lowing theorem. 
THEOREM 2. On any interval of the form [0, T] there is at least one continuous function, 
y(t), such that 0 < y(t) < (Got'””/T(3/2)) for all t > 0 and such that y(t) = S[y](2). 

In order to prove the existence of a solution of equation (6), we must prove the 
following theorem. 


THEOREM 3. If y(t) ts a continuous fixpoint of the transformation (18) for 0 < t < T, 
then y(t) < lforO <i < T. 


To prove this, we shall assume that the theorem is false and arrive at a contradiction. 
Let S be the set of points on the open interval 0 < ¢ < T for which y(t) > 1. Since 
y(t) is continuous, S is open. In fact, S consists of disjoint open intervals. Now define 


y(t) when iz S, 
y*(t) = 
1 when te S. 


The continuity of y*(t) follows from that of y(t), and obviously 
Slyl() = Sly*](d). (19) 
Let ¢ be an arbitrary point of S. Then ¢ belongs to an open interval of the form 


4 <¢ <4, and 


Ir. 


S 
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Jo [x(t — 7)] Je, [x(t — 7) |? 


But since y*(#) = 1 and G[y*(t)] = 0 when t, < ¢ < #, , and since G*[y*] = G[y*] is never 
negative we can write 


ito = [° SOL a + [ Se, 


; rs Gly*(r) ‘  Gly* ‘* Gly* 
Syl) = | “y — dz + / an dr = [ sel dr 


1/2 
Jo © (t= Jo 





' ats Gly* te 
“Bylo < | syle “*. 


0 


This means that 3[y*](t) < 1 on S. But since 3[y*](t) = S[y](, we infer that y(¢) = 
3[y](t) < 1 on S. This is a contradiction unless S is empty and thus the theorem is 
proved. 

We now notice that for any function, f(t), which does not exceed 1, G[f()] = G*[f(d] 
and for continuous f 


'  G*lf@n ‘Gif 
a[f](d) = [ aah dr = [ —pfhal dr. 


e /0 





Since we have just shown that the transformation 3 always has at least one continuous 
fixpoint, y(t), which is bounded between 0 and 1, y(t) has the following property 





*  G*ly(r a. 
y(t) = Sly] = / sn dr = i on dr. (20) 


0 
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We have now obtained the following existence theorem. 


THEOREM 4. For any input function, Gly], there is, on any interval of the form0 < t < T, 
at least one continuous surface temperature function, y(t), satisfying equation (6) and having 
the property that0 < y(t) <1. 


3y virtue of Theorem 1, this means that the nonlinear differential boundary value 
problem stated in Part II always has a solution. 

Theorem 4 expresses the best result which we have been able to obtain with the 
most general input function, i.e. with the input function subject only to Hypotheses A, 
B, and C of Part II. After strengthening Hypothesis A, we shall proceed to give a method 
for obtaining a solution of the surface temperature equation; to prove the uniqueness of 
this solution; and to find other properties of the surface temperature. This procedure 
of deriving increasingly better results by progressively strengthening our hypotheses is 
of mathematical rather than of physical interest, since the strongest hypothesis which 
we shall ever use in place of A, namely that G is analytic, can safely be assumed in all 
heat transfer problems of physical significance. 


TueoreM 5. If Glu] satisfies a Lipschitz condition on the closed unit intervralO < u < 1, 
then the set of approximating functions, {z,(t)} defined by the recursive formula (14) converge 
to a solution of (6) for all t > 0 and the convergence is uniform on every finite interval. If 
we denote this solution by y(t , then Go < y( (t) < 1 for allt > 0. 


The proof is elementary, but since we shall need the same method, with slight variations, 
in following demonstrations, we shall go through it once here and merely refer to it later. 

In view of the fact that G*[u(é)] = G[u*()] where u*(é) is defined as in (18), and 
since 2*(t) is between 0 and 1 for all ¢ > O we can write 


_ f' L@tet@)] — let) | Let(s) — 28() | 
7 i [a(t — 7)]'” var <b [ [x(t — 7)]'” 





where L is the Lipschitz constant. Since | z¥(é) — z¥(t) | < 1 for all ¢ > 0, this becomes 


% Ir L 2 
tate = ft < tl ee we ate 
| Z(t i(t) i. [ | (1 pin "7 T'(3/2) t 


Similarly, 


* | Glzek(r)] — G 2* — 2 
|2s() — 2x0 | = J = a st HU a, <b [ La i Jae. 


It is easily seen that 
| 2#.1(0) ae 2 (0) | < | Ze+1(2) = 2,(t) |. 


Therefore, 














L 1/2 
* 
| 22 (t) — z2¥(d) | . T@/2 ! 
Substituting this into the above bound for | z;(¢) — z2(é) | gives 
L ‘ a L’? (3/2) L’ 
| 2a) — 2240 te lo 4 = F872) 12) '= TE)” 
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Continuing in this way with the help of formula (11) we get 
i" 
— f. 
I'(n + 1) 
This difference approaches 0 as n © for any value of ¢t. By the inequalities (15), (16), 
and (17) it is therefore clear that on any finite interval the functions {z,(¢)} converge 
uniformly toward a limit function, y(t). By virtue of the uniformity of convergence we 
can pass to the limit under the integral sign as n —- in the recursive formula (14) and 
in this way show that y(#) must be a fixpoint of the transformation 3. But we proved in 
Theorem 3 that any continuous fixpoint of 3 is bounded above by 1. This implies that 
y(t) satisfies (20) and is therefore a solution of (6). 

Extending the Lipschitz condition to hold beyond the unit interval we prove the 
uniqueness of the surface temperature function. 








| Zonei(é) oes Zon(t) | < 


THEoREM 6. If Glu] satisfies a Lipschitz condition on the interval [0, 1 + €] for any ¢ 
greater than zero, then equation (6) has a unique bounded solution. 


We already know from Theorem 4 that (6) has at least one solution, y(t) such that 
0 < y(t) < 1 for all ¢ > 0. Let u(t) be any other bounded solution of (6). We know by 
Lemma 1 that u(¢) is continuous and that u(0) = 0. Since the integrand of (6) is positive 
when u< 1, we also know that the origin is not a limit point of zeros of u(t). Suppose 
that u(t) has zeros to the right of the origin and let ¢, be the first one. Let ¢, be the first 
value of ¢ on the interval 0 < t < t, for which y(t) = 1 + e€/k for any positive integer k. 
Then for 0 < ¢ < ¢, we have 0 < u(t) < 1 + e€ and thus the Lipschitz condition will 
hold for G. Applying the method of successive approximations illustrated in the proof 
of Theorem 5 to the difference u(t,) — y(t,) we obtain 


' . (1 e)L” n/2 
| u(t) — Lt OL" for all n. 2 
| w(t,) — y(t.) | < T(n/2 ) ty for alln (21) 





Since this difference tends to zero as n approaches infinity, we see that u(t,) = y(t.) < 1 
which is a contradiction. Hence there is no value of ¢ on the interval 0 < t < ¢, for which 
u(t) exceeds 1. But here again is a contradiction because it is impossible that u(t.) = 0 
since by Hypotheses B and C there can have been no negative contribution to the in- 
tegral in (6). This means that u(t) remains between 0 and 1. Therefore we can use the 
method of successive approximations to show that (21) holds for all ¢ and hence u(t) = 


y(é). 
TuEorEM 7. If Glu] is analytic on the closed unit interval [0, 1], then equation (6) has a 
unique bounded solution, y(t), and y(t) is analytic for all t > 0. 


The uniqueness follows from the fact that since G is analytic on the closed unit interval, 
it can be analytically continued onto a somewhat larger interval [0, 1 + «]. G obviously 
satisfies a Lipschitz condition on this larger interval and by Theorem 6 we have the 
uniqueness of the solution. 

The proof of the analyticity, which is somewhat lengthy, divides itself into the 
following three steps. First we shall show how the solution, y(t), of (6) can be represented 
as the limit of a certain uniformly convergent sequence {wu,(t)}. Second, we shall show 
how y(t) can be represented as the limit of a uniformly convergent sequence, {w,(t)}, 
of analytic functions of the real variable, ¢. Third, we consider ¢ to be a complex variable 
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and show that a sequence, {W,(t)}, of analytic functions of the complex variable con- 
verge uniformly to a function, Y(¢), which coincides with y(t) along the real axis. 

We begin the first step by introducing a set of functions defined as follows: 


u(t) = 0, 


‘ _ Glti.(7)] 





‘ _ G,[u,(7)] 
—_ 7% = a eee CC = “si 7D 
Un+i(t) [Un] (0) Jo [w(t a r)]'”" T } § [a(t wae 7)|'” dr, (22) 
Glu, (0), when u,(t) <1 +e, 
G,[u,()] = 
G[l + «], when u,(t) >1+e, 
and where 
u,(1), when u,(t) Cl +e, 
u,(t) = . 
1, when u(t) >1+e. 
We see immediately that when « = 0, the sequence {w,(t)} reduces to the sequence 
{z,(t)} already introduced by the recursive formula (14). Also, u,(é) = 2,(t) = 


G,/1(3/2)t'” for any ¢, but u,(t), unlike z,(é), will eventually fall below the t-axis for 
any « > 0 because of the negative contribution to the integral in (22) when the argument 


u, (t) 


de 


u4(t) Uo(t) 


ee, 
ult) TH ~ t 


Fia. 2. 


u3(t) 




















of the input function is greater than 1. Let us denote by T, this zero of u(é). It isa 
simple matter to verify the intuitively obvious fact that T, is a continuous function of 
« which decreases from © to some positive number T’,(0) as ¢« increases from 0 to ©, 
Confining our attention as usual to a fixed arbitrarily large interval [0, 7] on the t-axis 
we fix ¢ at a constant value sufficiently small that T,(e) > T and that 1 + e lies within 
the interval of analyticity of G. 
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Recalling the monotone decreasing nature of G, and making use of the fact that u.(t) > 
u(t) for 0 < ¢ < T, we can easily verify the following inequalities for 0 < ¢ < T: 


Blt) > alt) > ult) > +++ > tect) > *** ; (23) 
Uo(t) < u(t) < us(t) < ++ < Ue, (t) < ee: (24) 

and 
tia.s (0) > tld), (25) 


where » and k are independent positive integers or zero. Using these inequalities and the 


method demonstrated in the proof of Theorem 5, we can prove that the sequence {w,,(¢) } 
converges uniformly on [0, 7] to a fixpoint, y(t), of the transformation 3,{u](¢) defined 


in (22). 
We must now show that y(t) is the solution of (6) on the interval under consideration. 


From the above inequalities we know that y(t) > 0 for 0 < ¢t < JT. Assume now that 
| , 


t, is the first value of t for which y(t) = 1 + e. Then on the closed interval [0, t,] G.[y(d] 


Gly(t)] and y(t) is therefore the solution of (6) on this subinterval. But this means that 
y(t) < 1 and contradicts the assumption that y(t,) = 1 + e«. Hence y(t) is the solution 


of (6) on the entire interval [0, 7’]. 
The functions {w,(¢)} are not analytic because of the truncating process employed 
in their definition. We shall now represent their limit function, y(é), as the limit of a 


uniformly convergent sequence of analytic functions. Choose N so large that 


n>N->0 <u(t) << 1+ 6/2 for ox ts T (26) 


and pick k so large that 2k > N. It is our purpose to interpose an analytic function, 
say w,(t), between u,(f) and u,42(t) on 0 < t < T. By inequalities (23), (24), and (25), 
and by reasoning similar to that on which Lemma 2 is based, it is easily seen that the 
successive transforms of w,(t) under 3, will lie between those of u.,(t) and w,,.(t). If we 
designate these successive transforms by {w,(t)} it is therefore clear that lim,.. w,(t) = 


y(t) uniformly on [0, 7]. Moreover, these functions will be analytic on the open interval 
0 < t < T, since none will be truncated in the recursive formula. From (22) and (26) 


we see that their recursive definition could be written as follows: 


G[w,(r) re 
Wn+i(t) = a(t — ee 7 dr. (27) 





In other words, each w,(t) is the fractional integral of order 1/2 of a function analytic 
fori > 0. 

The u,(t) themselves are analytic on the open interval from the origin up to the 
point ¢ = T[(3/2)(1 + ©)/G,]’ = 7. where u,(t) is truncated. On this interval [0, 7’,] 
the {u,(t)} have the same recursive definition as that for the {w,(t)} shown in (27) 
except that u(t) = 0. In the neighborhood 0 < t < T, ua,(t) and u,,2(t) can be ex- 


panded in powers of +/t as follows. 
Un (t) = at’? + at +aj,t? +--+ +a,07 +--+, 
Uoree(t) = b,t'? + bot + bat? + eee + bt? te eee, 


Suppose that a; = b; fori = 1, 2, 3, «++ , mand that dns: ~ Omer - 


bea | 
“I 
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Clearly },.4; > @ns:. Now form the following functions 


Uoe(t) — fn(t) Urxso(t) = Ure+2(t) — full) 


pomt 72 gomt 72 ’ 





U2,() = 


where 


fn(t) = \ # a,t'? = p i. 
i 1 


»(t) and U,,(t), such that U,,..(t) > U2,(t) forO < t < T. 
There will be a minimum distance, say 6, between the two functions on [0, 7]. Approxi- 
mate to the mean of the two functions by closer than 6/2 with a polynomial, P(t). Then 


This gives two functions, U», 


we can write 


Ux() < PO) < Unxxs.(d, Oo<sts Tf, 
rere tf" re ew 0<t<T, 


fr" fb + 710 < f""""R(D + £40 < O°" "Un AO + 140, <4 F. 


"=P "U,() + frlt) = ux) and fr" U (0) + S(O) = thease(O. 
Hence if we denote t'"*"*P(t) + f,,(t) by wo(t) we have 
Uay(t) < Welt) < tense(t) for e<i<T 


and w,(t) is analytic on this interval. Using w,(t) in the recursive formula (27) we get 
the desired sequence of analytic functions, {w,(t)} converging uniformly to y(t). 
Now consider ¢ to be a complex variable, t = u + iv. The functions {w,(é)} will 


sf 
We-r+tis 








a JOsrsliteé 
> r R: 
isi sS 











Fie. 3. 


then become complex functions which we shall denote by {W,(} where W, = r, + ts, . 
Since G is analytic on the closed interval [0, 1 + ¢] we know that it has an analytic 
continuation into a neighborhood of this interval in the complex W-plane. In particular 
we know that there is a closed domain, FR, of the form shown below on which G is analytic. 








178 W. ROBERT MANN AND FRANTISEK WOLF [Vol. IX, No. 2 


Since G is analytic on R which is closed, we know that there exists a number, D, such 
that 
WeR—|G[W]| < D, (28) 


and since W,(t) is a polynomial in powers of +/t, we know that it is analytic in any 
region of the complex ¢-plane which does not include the origin. Consider now a region 
A in the é-plane of the form shown below 


0 ae te a 


a: 
boi — = 
O<u<T 
é *}ivi<V 





teutiv 





Y 











Fia. 4. 


W,(t) can be regarded as a transformation which maps the region, A, conformally from 
the t-plane into the W-plane. Recalling (26) and the construction of wo(t), we see that 
W,(t) carries the segment [0, 7] into a subset of [0, 1 + e€/2].on the r-axis. From this 
and the continuity of W,(t), it follows that by taking V sufficiently small, say V < V,, 
W,(t) will map A into R, i.e. 
teA — W,(t)eR — | G[W.(d] | < D. 
It is now our purpose to show that by taking A sufficiently narrow, 
teA — W,,(beR, for all n. (29) 


Integrating along the path shown in Figure 4 we have 


W(t) = le ini G[W pra, sd [ _G[Wo(7)] _ te 


The real part of W,(é) can be estimated as follows 


R{Ww,(i} <]Js aL, 73 dr + | [ me Par 


1 ou n'? (¢ 





putiv 


dr 


2% +i 4 D | aAG, a -anaprye ify?” 
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Along the vertical strip, 7 = u + i and dr = idé 


dé s € D 1/2 
o-p7-—'tat Tem’ 





R{W,(b} < 1 +54 D [ mE 
Jo 


This shows that for all values of ¢t belonging to A, we have 





R{Wi)} <1+§ 5 + rr y,, 


g ris/2) [ “ 
; <| 2D = Ve, 


we insure that D/T'(3/2)V'” < ¢/2 and therefore R{W,(t)} < 1+ 
Turning now to the imaginary part, we observe that along the stretch [0, uw], 


By requiring that 




















vada (2v)'" 
(4 f —— ie Oe 
I{(t 7) P< I(u no r)” + v 4 ae 
y’ G{W.(7)] D gs 
, } Pee od 
bie 1S Qn)? J, (u — 9* + 0°7]'” ~+ T(3/2)” 
< Gow -—* {(u? + v*)'? + u} — Inv] + = p_ r 
_ r(3/2) ° 
Thus, for any ¢ belonging to A, the following inequality holds 
r £s\) G, GV” 2 72\1/2 
{Wi} < yA [In ((7? ++ V?)'7 + T} - ae as? 


Since the first term on the right-hand side of this inequality tends to zero with V, there 


exists a number, V; , such that 
V<V;—]|9{W,O} | < S. 


Therefore, by taking V to be smaller than the least of the three numbers V, , V2, V3, 
we define a region A, in the complex ¢-plane which is mapped by the function W,(é) 
into the region R in the complex W-plane. By induction it is easy to show that each 
one of the functions W,(¢) maps A into R. 

We have now shown that {W,(t)} is a sequence of functions which are analytic in 
A, equibounded in A, and which converges uniformly on the interval 0 < ¢ < T, which 
is an infinite subset of A, to the function y(t). By Vitali’s theorem we can conclude that 
{W,(t)} converges uniformly throughout A to a function, Y(é), which is the analytic 
continuation of y(t) into the complex ¢-plane. y(t) is therefore an analytic function of 
the real variable ¢ for 0 < t < T. This completes the proof of Theorem 7, since T was 


chosen arbitrarily large. 
TueoreM 8. If G[y] is analytic on the unit interval 0 < y < 1, then y(é) is monotone in- 
creasing for allt > 0. 
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By differentiating equation (6) we get 
A i 
Go 


(wt)'/”" 


uh om [ Elv@lv'(s) (30) 


y’(t) 5 dr - 
y l iw-oe™* 


Perhaps the easiest way of justifying the above result of differentiating the singular 
integral in the surface temperature equation with respect to the parameter ¢ is to observe 
that (6) can be written y(t) = G[y(t)]*1/(xt)'’” where the star indicates the convolution 
or faltung operation. Then, by a well-known theorem on the differentiation of the faltung 
under hypotheses which are satisfied by the above functions [6], formula (30) follows 
immediately. 

y(t) is a power series in t'”* and is analytic for t > 0, but whereas by Lemma 1 y(0) = 0, 
y'(t) approaches infinity like t'’’ as t approaches 0. If y’(t) ever becomes negative, 
there must be a first zero, say ¢, , to the right of which is an open interval, /, on which 
y’(t) is less than 0. Let ¢, be any point belonging to 7. Then we must have 


es ida G’ [y(7)] oF Go 7 
y(t) = | 7, wi 7) y’(r) dr + (rly)? = 0, (31) 
ss fo G4’ [y(r) ly’(7) rs G’[y(r) ly’(7) Go 
dee | rai-2 T J € (hh — 1)” silted (wt,)'”” 


The latter equation can be written as follows 


, pio (4, — 7\'” G’[y(7)] = 
vo = [ (Ra) have a 


1/2 


t v/f Y 
: G’ly(7) G, ti 
Je, © (4 — 7) (arly) t; 
In the first integral on the right, the factor [(t; — 7)/(t, — 7)]'’” decreases monotonically 
from ¢,"/t}’? to zero. By the theorem of the mean we know that there exists a number 
7, between 0 and ¢, such that 
\ > at Y at, Ys 2 Y 
tp — 7,\'" G’[y(7)] ne "* G4’ [y(7)]y’(7) ti G, 
z | aie 73 Y (7) dr + *. iz dr + 7 172° 
1/2 1/2 ¢ / / 2 
' "(bo — 7) 4 f G~ t,~ (alo) 
Since G’[y] is negative for all y, and since y’(t) has been assumed to be negative on the 
interval J, the second integral above must be positive. Comparing this last expression 


for y’(t) with equation (31) and noticing that 


1/2 1/2 
(4 eas | < to 
4 — 7 “as 
it is obvious that y’(t,) < 0. This contradiction shows that y’(t) cannot become negative, 
and since y(t) is analytic for t > 0, we know that y(t) must be monotone increasing. 


THEOREM 9. Let G, be the space of input functions G[u] which satisfy a Lipschitz condition 
with constant <L on an interval of the form 0 < u < 1+ «€ for any « > O. Let Y be the 
space of corresponding surface temperature functions, y(t). As a metric we take the norm 
of the difference. The equation (6) represents a continuous implicit functional transformation 


from G,, to Y. 


1951 HEAT TRANSFER BETWEEN SOLIDS AND GASSES 181 


By Theorem (6) we know that for each element G of G, there is a unique element y(t) 
of Y, and that 9 < y(t) < 1 for all t > O. Select arbitrarily any member G, of G, and 
let y,() be its corresponding element in Y. Let G, and y,(t) be any other corresponding 
pair of functions. The difference between the two surface temperature can be written 
as follows 


1/2 5 — 


at G, mn 7” 
vs) — who = [SLY = Gls gy 4. f° Gl — Gly gy, 

Jo re = 
If we specify that G, be chosen in such a way that | G,[u] — G,[u] | < 6for0 < u < 1, 
then a bound for the above difference is shown below: 


; Ys i yi(r) | [ 
(i) — - — a : 
Yo\l) Y\ <b [% —-s dr + ne iy 7)? dr (32) 
In view of the fact that | y(t) — y,(Q | < 1 for all t > 0, (82) becomes 
E es Ir ‘ dr L + 6 2 
(t) — y,(b L _—.. 172 [ ee ee me” 
nl) —WOI<L] pagan ts] agian = rep! 


Substituting this bound back into (32) we get 


e LL + 54) 
| Yo (2) y(t) r(2) t + r( oe /2) 





1/2 
Es 


By repeating this process of successive approximations n times we get 


n 
L 6 (n+1)/2 


6t'”” Lé L’5 
y(t) — y, (ft) < — « A ne 
, “ ~ T@/2) ' r(2) ‘ * r(5/2) * te r'((n + 3)/2) 





Lt + 8) nas 
° (n/2 + 2) 


Letting n tend toward infinity, the above can be written as follows 


; te Le, De 
Y. t) — y(t) ._ Lé ‘- + r3) 7 as + I'(5) * | 


i} 1, Dt, NE, | 
itis FE » + F6/H * 17/2 * 17/2 
Restricting ourselves to an arbitrarily large fixed interval [0, 7] we can now show that 
| y(t) — y(t) | < 67’? [1 + LT’ \e”"". 


For any given L and T the above difference can be made arbitrarily small by taking 6 
sufficiently small. This completes the proof of Theorem 9. 

Making use of the fact that any monotone continuous function, G[u], which satisfies 
a Lipschitz condition with constant L can be approximated arbitrarily closely on any 
closed finite interval by a monotone analytic function which also satisfies a Lipschitz 
condition with constant L, we can draw the following conclusion. 


TueoreM 10. Jf the input function G[u] satisfies a Lipschitz condition on the interval 
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0<u<1+ «, for any « greater than zero, then the associated surface temperature function, 
y(t), 7s non-decreasing for all t greater than zero. 

This theorem is a simple consequence of Theorems 8 and 9, and the proof will be left 
to the reader. We shall merely indicate here one way in which any input function, G[u], 
can be approximated on [0, 1 + e] by an analytic input function. Define H[u, d] as 


follows 


dh [” Gifu + €] ss G, [x] 
u, Ar = 5 = lt = 7 ee lx 
H 7 » & +r ws i aqg—-vw +r 
| ax/2 
a | Gifu + » tan 6] de, 
T J—x/2 
Gil + e] when u>i1+te, 
Glu] = ( Glu] when O<sucslte, 
G[0] when u <0. 


It is easily verified that H[u, \] is continuous in X, that lim,» H[u, 4] = G,[u] and hence 
that the convergence is uniform on the closed interval [0, 1 + e]. By examining the differ- 
ence H[u, , 4] — H[u, , A] it is evident that the monotony of G, on [0, 1 + €e] implies 
that of H, and it is obvious that if G, satisfies a Lipschitz condition with constant L, 


the same is true of H. Furthermore, H[u, \] is analytic in wu. 

THeoreM 11. Jf the input function Glu] satisfies a Lipschitz condition on an interval 
[0, 1 + «] for any « greater than zero, and if y(t) is the corresponding surface temperature 
function, then y(t) < 1 for all t > 0, and lim, y(t) = 1. 

Assume that y(t;) = 1. Then since by Theorem 10 y(t) is non-decreasing, and since 
0 < y(t) < 1 for all ¢ > O, it follows that y() = 1 for all t > ¢, . Let ¢, be any value 


of ¢ greater than ¢, . Then 


ie G[y(7) 
y(t 172 ily 7”) i73 dr I 
T (¢, wr Ts 
ies G[y(7) | the Gly(r)| 
y(t : dr - ; 7 : 
Yor | =" 1, — 7)? + I g/t, — 7)! 3dr 


But since y(#) = 1 fort; < t < #, , it follows from Hypothesis B that G[y(r)] = 0 for 


t; < r+ < t,. Therefore the last equation reduces to 


nt . at in ‘ 
‘ Gly(7)] ‘_ Gly(7)] 
y(t, ) = | rt, — 1)? dr <S / x(t, — dr == i, 
since both integrands are positive for 0 < ¢ < ¢, and ¢, > ¢, . This last conclusion that 
y(t.) < 1 contradicts the earlier implication that y(¢) = 1 for all ¢ > ¢, . Hence our 
assumption that y(t,) = 1 is impossible and we have that y(t) < 1 for allt > 0. 
In order to prove that lim,.. y(t) = 1, assume that this is false. Then by the non- 


decreasing nature of y(t) we infer that there exists some number 6 > 0 such that y(t) < 
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1 — 6 for allt > 0. By Hypotheses A, B, and C this means that G[y(t)] > G[1 — 6] > 0 
for all t > 0. This means that 


_ Gi — 6] ~ G[l — 4] 1/2 
WO > |, FAG o”6—tCi‘P 


“0 


But the right-hand side of the above inequality can be made arbitrarily large by taking 
i sufficiently large. This contradicts the fact that y(t) < 1 for all t > 0. Hence 6 does 
not exist and therefore lim,... y(t) = 1. This completes the proof of Theorem 11. 

In view of this last result it is not hard to see how the hypotheses in Theorems 6, 
7, 8, 10, and 11 can be somewhat weakened. Let G[u] be any input function which satisfies 
a Lipschitz condition on the closed unit interval [0, 1]. Clearly, G[u] can be continued 
outside this interval as an input function in such a way as to satisfy a Lipchitz condition 
in the large. One such extension is shown below: 


l—u when a i, 
G,[u] = (G[u] when 0<u<1l, 


G[0] — u when u <0. 


By Theorem 6 there is a unique surface temperature function, y(d), corresponding to 
G, , and y(t) is independent of the behavior of G, outside the unit interval. Since, by 
Theorem 11, y(t) < 1 for all ¢ > 0, the uniqueness proof in Theorem 6 can be carried 
thru without regard to the behavior of G, outside the unit interval. Hence, the Lipschitz 
condition assumed in Theorems 6, 10, and 11 need hold only on the unit interval. 
Similarly, it is sufficient in Theorems 7 and 8 to require that G be analytic on the open 
interval (0, 1) and satisfy a Lipschitz condition on the closed interval. 

4. Summary. Recalling that the input function, G[U(0, ¢)] is simply the product of 
the film transfer factor, f[U(0, t)] and the term [1 — U(0, t)]/k we can summarize as 
follows the results which we have obtained: 

Conc.usion 1. For any film transfer factor of physical significance, the heat transfer 
problem stated in Sec. 2 always has at least one solution, U(z, ¢), for all > 0, ¢ > 0. 
It can be constructed in the following way 


, . f' MG.9) _-# 
U(x, t) = |, F720 — 9)” exp u- 3 dr 8) 





from the surface temperature function U(0, t) which must be a bounded solution of the 


following nonlinear integral equation 


‘ gu, z 
U(0, t) = [ ae Mn dr. (6) 


ry 





Equation (6) always has at least one continuous bounded solution which satisfies the 
inequalities 0 < U(0, t) < 1 for all ¢ > 0, and having the property that U(0, 0) = 0. 
From these inequalities it follows by applying the mean value theorem to equation (8) 
that 0 < U(z, t) < lforallz >0,t> 0. 


Concuiusion 2. If f[U(0, #)] satisfies a Lipschitz condition on the closed unit interval, 
then we can add to Conclusion 1 that 
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a) U(O, #), and therefore U(a, #), is unique. 
b) U(0, t) is non-decreasing for all ¢ > 0. 
ec) U(O,t) < 1forallt > 0. 

d) lim,.. U(0, ¢#) = 1. 


Conc.iusion 3. If the film transfer factor is analytic on the open unit interval and 
satisfies a Lipschitz condition on the closed unit interval, then we can add to Conclusion 


2 that U(0, ¢) is analytic and monotone increasing for all ¢ > 0. 


Conc.usion 4. If the film transfer factor satisfies a Lipschitz condition on the closed 
unit interval, then U(0, ¢) can be approximated uniformly, arbitrarily closely in the 
large by the method of successive approximations defined in formula (14). Since the 
successive approximations lie alternately above and below U(0, ¢), an upper bound for 
the error in the nth approximating function is simply the difference | U,(0, t) — U’,_, (0, £) | 

As a final remark we point out that the methods developed here in the treatment of 


the surface temperature equation, 


G[y(7)] (a 
y(t) = | int — n]@ dr (6) 
are also applicable to a much more general equation, 
at 
y(t) = | K(t, 7)G[y(7)] dr (33) 


in which the kernel, A(t, 7), need only be positive definite and satisfy appropriate in- 
tegrability conditions. The theory for the more general equation (33) will be developed 


in a later paper. 
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A NOTE ON THE EXISTENCE OF A SOLUTION TO A PROBLEM OF STEFAN* 


BY 
G. W. EVANS If! 


Institute of Mathematics and Mechanics, New York University 


When certain metals are heated slowly, the temperature rises until it reaches a 
critical temperature at which the structure of the metal changes from one crystalline 
form to another. As for example, iron changes from a to 8 crystals at 1643°F. Ac- 
companying this change of crystalline form is a latent heat of recrystallization. In 
order to study the process we investigate the associated mathematical problem, which 
requires the solution of a partial differential equation in a region with an undetermined 
boundary. Our analysis establishes the existence and uniqueness of the solution. In a 
previous paper’ this problem is treated from the point of view of computing the solution. 

Suppose a metal slab having two infinite parallel faces is brought uniformly to the 
critical temperature and then heated by a uniform source covering the front face while 
an insulator covers the back face. Under these conditions, the new crystals are first 
formed at the front face, and the interface between the new and old crystals travels 
from the front face to the back face. Mathematically the problem can be stated as 


follows, where u = 0 is taken as the critical temperature: Find the temperature, u = 
u(x, t), and the curve, x = x(t), which satisfy the following conditions 
Up = A Use for0 < x < 2x(2) (1) 
u =O for x = x(t) (2) 
—Ax'(t) = u,[x(d), ¢t] where A > 0 (3) 
x(0) = 0 (4) 
u(0, ) = —g (5) 


where g is a constant >0. 
In this notation 


u, = du/dt, Uz, = 0'u/dz’, x(t) = dx(t)/dt, 


a’ is the coefficient of thermal diffusivity, A = pH/k where p is the density of the metal, 
H is the latent heat of recrystallization, and k is the coefficient of thermal conductivity. 
We simplify the notation by introducing new variables as follows: 


v(y, 7) = u(z, t)/ Aa” 
y = gx/Ad (6) 


g t/A 2? 


be | 
I 
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and, then, by renaming v by u, 7 by t, and y by z, the mathematical statement of the 


problem is: 


Find u = u(x, t) and x = x(t) where the temperature u(z, t) satisfies the following 
equation 
Use = Uy for0 < x < x(t) (7) 


with the boundary conditions 


u=0 for x = x(t) (8) 
a(t) = —u,[z(d), ¢] (9) 
2(0) = 0 (10) 
u,(0, t) = —1. (11) 


In this discussion we will use a theorem of Dr. Louis Nirenberg® on the parabolic 
equation. For the requirements of this paper, a restricted statement of this theorem is 
given below as our principal lemma. 

Lemma 1: Let R be a simply connected region in the z,f-plane where 0 < t < T with 
a part of the boundary of R being t = T and the remaining part of the boundary being 
given locally by a curve x = A(t). Furthermore, let u(z, t) be a continuous and bounded 
solution’ of the heat conduction equation having continuous derivatives satisfying 
Uz, = wu, in the interior of R, and let the solution be continuable into the region for 
which ¢ > T. If u(z, t) assumes its maximum or minimum in R, say at a point (&, 7), 
other than at a point of the boundary x = A(t), then u is a constant in the subregion 
described as follows: the subregion consists of all points of R which may be reached by 
a continuous curve t = f(s), x = g(s), where t = f(s) is a monotonic nonincreasing 
function of s, starting from any point in FR that lies on the line ¢t = +. 

Furthermore, we assume the following two lemmas’: 


Lemma 2: There exists a bounded solution, u, of u,, = u, with bounded continuous 
first derivatives in the interior of the region 0 < t < T,0 < x < X(t) where X(t) > 0 
is a curve with X(0) = 0, and uw assumes the following boundary values: u,(0, t) = —1 
and u[X(é), t] = 0. 

Lemma 3: There exists a bounded solution, u, of u,, = u, with bounded continuous 
first derivatives in the interior of the region 0 < t < T,0 < x < X(t) where X(/) > 0 
is a curve with X(0) = 0, and u assumes the following boundary values: u,(0, t) = 0 
and u[X(t), t] = u(t) > 0. 

We now establish the existence of the solution x = x(t), u = u(z, t) of the problem 


given in (7)-(11). The proof consists in applying an iteration scheme to the equation 


z(t) 
a(t) = t— [ u(x, t) dx. (12) 


’This theorem is still to be published ; but a similar theorem which is not as general, but which would 
be satisfactory for this paper, was proved by Mauro Picone (Sul problema della propagazione del calore in 
un mezzo privo di frontiera, conducttore, isotropo e omogeneo, Math. Ann. 101 (1929)). 

‘The problems of Lemmas 2 and 3 are of the type which were considered in a more general way by 
W. Sternberg (Uber die Gleichung der Warmeleitung, Math. Ann. 101 (1929)). 
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This equation is derived by evaluating 
t z(t) 
[ [ (u,, — u,) dxdt = 0 
“0 “0 


with the boundary conditions (8), (9), (10), and (11). Our procedure determines x(t) 
and u(2, t) in the following way: let 


2,(t) = t— “<< un—\(a, t) dx (13) 


“0 


with u,.,(z, t) the solution of Lemma 2 for X(t) = z,-,(t). This procedure has been 
chosen because, after it has been shown that u(z, t) = Lim,.. [u,-, (2, t)] is the solution 
of Lemma 2 with X(t) = 2(t) = Lim,... [z,(], we may differentiate Eq. (12) with 


respect to ¢ and find that 
u,[x(t), t] = —2°(0) 


which is the boundary condition (9) of our problem not contained in Lemma 2. 
The z,(t) for n = 0, 1, 2, 3, --- are monotonic non-decreasing functions of ¢, i.e. 


z(t) > 0. 
This may be seen by differentiating Eq. (13) with respect to ¢ giving 
x(t) = —Un-1,[%n-1(8), #]. 


And, it remains to show that w,-,,[z,-,(0, t] < 0. By Lemma 1, u,-,(z, ¢) must have 
its maximum and minimum value along x = 0 since u,-_,(2, t) = C cannot satisfy the 
condition u,_,,(0, 2) = —1. Furthermore, one can show 0 < u,_;(0, t) < M where M 
is the upperbound of u,_,(z, t) of Lemma 2; and since u,-,[z,-,(t), t] = 0, then 
Un 1el2awa(é), i} < 0. 

To show that 0 < u,(z, t) < M for n = 0, 1, 2, 3, --- , form »,(2z, t) = u,(z, ) + 2 
where wu, satisfies the conditions of Lemma 2. v,(z, t), then, satisfies the equation 


Uneo(2; t) - 0, (2, t) (14) 
with the boundary conditions 
v,,(0, t) = 0 and v,{z,(t), t] = 2x, (2) 


and is a solution of Lemma 3. Since v,,(0, t) = 0, we reflect the solution about z = 0 


giving the boundary conditions: 


v,{x,(t), t] = x,(t) and v,{—2,(t), ] = 2,(@). 


Applying Lemma 1 to »,(2, é) in the region between the two curves x = —z,(t) and 
x = x,(t), we see that v,(z, t) must assume its maximum and minimum along x = 2,(é), 


i.e., v,(xz, t) ¥ C since v,[z,(t), t] = 2,(t) ¥ C. For any given t = + 
0 < »,(z, 2) < maximum [7,(2)] for 0O<t<r 


or 


lA 
lA 
4 


—z < u,(z, t) < maximum [z,()] — x for 0 



















188 G. W. EVANS II [Vol. IX, No. 2 


But, u,(2, }) is defined only for « > O and assumes its maximum and minimum along 
x = O, therefore 
0 < u,(z, 2) < maximum [7,(t)] — a for 0.0% -. 


Using, now, the monotonity of x,(t), we have 


os ws, t) ~ ete) — 2. (15) 
To show that the Lim,,... [z,(4)] exists, it is sufficient to show that 
| Zasn(l) — 2, (8) > 0 as no 
uniformly in ¢ for sufficiently small ¢, say all t < 7. For this, we choose zx,(1) f, then 


at 


z(f) =t | Uy(a, t) dx 


/ 


and 
at 


z(t) — 2(@) = | uo(x, t) dx. (16) 
From Eq. (16) and the inequality (15), z,(@) is seen to lie to the left of x(t) {in the 
z,t-plane where the x-axis is taken in the horizontal direction. Similarly, 


at 


z,(t) z,(t) | [u,(x, t) u(x, t)] dx + | u(x, t) dx > 0 (17) 
J2,(t) 


“0 


(t 


if we can show u,(z, t) — w(a, t) < O. Let 
v,(z, t) = u,(z, t) — u, (2, 0), 
then v,(z, t) satisfies the differential equation 
V,,,(%, t) = v,,(z, t) 


with the boundary conditions: 
y,. (0, = 0 and viz (0), t) = ulz,(b, 4] > 0. 


By Lemma 3 a solution exists to this problem; and since »,,(0, t) = 0 we can reflect 
the solution about 2 0 thus making v, (x, t) satisfy the alternate boundary conditions: 
v,[2,(0), ¢] u,[x,(t), t] > 0 and v,[—z, (0), t] = u[z,(0, t] > O. 


By applying Lemma | with these boundary conditions, v,(z, t) must assume its maxi- 
mum and minimum along z,(¢); and since v,(z, t) # C, v,(2, t) > 0. By the expression 
(17), 2.(¢) must lie to the right of z,(t); and by considering 
exr,(t) 
x(t) —2(t) = — | u,(x, t) dz, 
x,(t) must also lie to the left of z(t) = t. In fact, by replacing 2x,(t) by 2,(¢) in the pre- 
= 2, 3, 4, --- , must lie between x(t) and x,(t). By 


ceding argument, all z,(t), for n = 2, ¢ 
continuing this process 
ri(t) ra(t) 
z(t) — 2,(t) = — [ [u.(x, t) — u(x, 2) dx — / u(xz,t)dx <0 (18) 
ri(t) 


“0 
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since 
v,(z, f) = u,(z, t) —u,(z, 2) > 0; 
and 
ta(t) t 
a(t) — 2,(t) = — [ [u(x, t) — u(x, t)] dx + [ u(xz,tdx>0. (19) 
“0 “2ra(t) 
The expressions (18) and (19) show that 2,(é) lies to the left of x,(t) and to the right 
of x,(4). By further continuing the process it is seen that 
Lor-i(l) < Tull) S Leu-1)(b) 


and 
Tor-ill) S Loess (b) < Xo, (b) 


for k 1,2,3,---. That is, each curve z;(t), i = 2, 3, 4, --- , lies in the region between 


9 Ys 
the preceding two curves, x;,_,(t) and 2,-_,(t). 
To establish the existence of the Lim,.. [z,(é)], consider the difference, as defined 


by Eq. (13) of 


tat) Zn-1(t) 
Invill) — 2,(t) = — [ u,(x, t) dx + [ u,—\(x, t) dx. (20) 


“0 “0 


If x,(t) lies to the right of z,_,(0), then Eq. (20) becomes 


tn—1(t) 


Tn—1i(t) 
Inai(t) — 2,(t) = [ [u,-(x, t) — u,(x, t)] dx + [ u,(z, t)dx, (21) 
“0 “2n(t) 
and if z,(t) lies to the left of z,_,(t), Eq. (20) becomes 
rn(t) tn-i(t) 
In+i(t) — 2,(t) = [ [u,—:(a, t) — u,(x, t)] dx + [ Un-(2, t) dx. (22) 
/0 W2rnl(t) 


To estimate the absolute value of the difference in Eq. (21), choose a time, say t = T, 


and by using the expression (15), Eq. (21) becomes 


| tn4i(7) — 2,(T) | = | [ ie [u,-.(z, T) — u,(x, T)] dx 
(23) 
tn—-1(T) | 
+ [ ude, 7) dz |. 
“2zn(T) | 


Since u,(z, T) < T (for 0 < u,(z, T) < u,(0, T) < T), by substituting T for u,(z, T) 
in the second term of the right hand side of Eq. (23), the left hand side satisfies the 


following expression 


tn-1(T) 
| zu(T) — eT) |< [| f(a, 7) — real, TH] | + | de | 
ss (24) 


+7 |2,(T) — 2,-(T) |. 
To estimate the integral in the inequality (24), form 


vo(z, t) = u,(z, 2 — u,-:(2, 8 
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which satisfies 
with the boundary conditions 
v(0, t) = u,,(0, t) — u,-1,(0, t) = 0 
and 
viz), t= aie. -.(f); t]) > 0. 


> 


By Lemma 3 a solution exists and since v,(0, t) = 0, reflect the solution about x = 0, 
thus making v(2, t) satisfy the alternate boundary conditions 


v[zn-i(t), t] = u,[r,-1(t), t] > 0 


v[_—2z,-:(t), t] = u,[z,-,(d), t] > 0. 


Now, by applying Lemma 1, v(z, 4) must assume its maximum along z,-,(t) since 
v(a, t) # C. But, by the inequality (15), along x = z,_,(0), 


Un{tn-1(t), t] — ups [2,-,(0), t] = u,[2,-,(0, t] < 2,(t) — xn-, (0). 


Therefore 


| u,(z, 7) — u,-;(z, T) | < | 2z.(T) — 2-:(T) |. (25) 
The inequality (24) now becomes 
ern—1(7') 
| tnoi(7) — 2,(T) | < | 2,(T) — 2,-.(T) | | |de|+T7|2,(T) — x,-,(T) | 


< 27 | x,(T) — 2,-1(T) | 


since z,.,(7') < T. Now choose 7, = 4, then fort = T < } 
4 4 


| a1 (t) — x, (t) ' < 3 | %(t) i Ln-1(t) |. (26) 


The result (26) may be deduced in a similar manner from Eq. (22). The inequality (26) 
establishes the existence of the Lim,_.. [z,(¢)] = a(t) fort < }. 


Since for ¢ < } one has already shown that | z,,,(t) — x,(t) | may be made less than 
efor n > N(e), and since by the expression (25) 


| u,(z, t) — u,-:(z, 2) | < | 2z,(0) — 2,-:( |, 


the Lim,,... [u,(z, t)] = u*(z, t) exists for t < }. 


Since it is clear that the limits of the iterations satisfy the integral equation (12), it 
remains to show that u*(z, 2) satisfies the differential equation (1) with the boundary 
conditions of Lemma 2. To do this let u(x, t) be defined as the solution of Lemma 2 
for x = x(#) where x(t) = Lim,,... [z,(¢)]. Set 


axr(t) 


z*(t) = t-— | u(x, t) dz, 





1951 | EXISTENCE OF A SOLUTION TO A PROBLEM OF STEFAN 





and form the difference 


z(t) 


Tn—-1(t) 
2*(t) — z,(t) = [ [u,—.(a, t) — u(x, t))] dx — [ u(x, t) dx. (27) 


Vrn—1(t) 
Next, consider the limit of Eq. (27) as n —@, but choose the sequence through which 
n varies so that x,_,(t) always lies to the left of x(t). Then 
eIn-il(t) 


Lim [x*(t) — 2,(é)] = Lim {| 


“0 


[un—.(x, t) — u(z, t)] az}; 


so set 
—v,~-;(z, 2) = u,.1(z, 2 — u(z, d 
which satisfies the equation 
Va-te.(2, 8) = Ug-1,(2, 8) 
with the boundary conditions 
v,-1,(0, 2) = O and Vn—1[2n-1(0), t] = ulz,-_, (8, t] = fd. 
Therefore v,-;(z, t) satisfies Lemma 3. Since v,_,,(0, t) = 0, reflect v,_,(x, t) about x = 0, 
then v,-_,(x, t) still satisfies the same differential equation with the alternate boundary 
conditions 
Va-i[2n-1 (8), t] = fd) and v,-:(—2,-, (2), ] = f(. 


By Lemma 1 either v,_,(z, t) = 0 since v,_,(0, 0) = 0 which is what we desire, or the 
maximum or minimum of »v,_,(2, t) lies on x = 2z,_,(t). If the second condition is true, 
let n tend to infinity in the prescribed manner, then 


Lim {v,_,[z,-,(t), t]} = Lim {—u[z,_,(d, §} = —ulx(d, t] = 0. 


This implies that x*(t) = 2(¢) which implies that u*(z, t) = u(a, t) or that u*(z, 4 
satisfies the differential equation (1). 

In verifying the solution of the problem, one must show that dx (¢)/dt exists where 
a(t) = Lim,,... [x,(f)]. This is easily shown by the definition of a derivative and with 


the aid of Eq. (12): 








dx(t) _.. | = + At) — 2(0 
eae At (28) 


a(t+ At)— a(t) t+ At—t  fo"**? u(x, t+ Ad dx — fo"? ula, dx 
t . t At 








_ pO | alu, t + At) — u(z, a] ms | ae u(x, t + Ad) 
a I | At wmf at (28) 


ulg(At), t + Ad] p7"*4? “. 
a z(t) 





“so [ u,[x, t + O(x) At] dx — 


“a 
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where 0 < @(x) < 1 and a(t) < ¢(At) < a(t + Al). ulg(Ad), t + Al] is a mean of the 
values u takes as it varies from u[x(t), t + At] to u[a(t + Ad, t + At]. Using the last 


expression above, the left hand side of Eq. (28’) may be written as 


a(t + At) — a(t) 1 — fi’ uw lx, t + O(a)At) dx (29) 
At 1 + ulp(At), t + Ad] ii 


The integral on the left hand side of Eq. (29) may be evaluated as follows: 


t) ox(t) 


| wx, t+ O@)At}dx = | — waelx, t+ O(2) At] de. (30) 


“0 “0 


Since u,,.(z, ¢) is continuous and bounded for all finite t > 0 and 0 < 2 < 2(t) we may 


write Eq. (30) as 


px (t) 


(u,.(a, t) + e] dx = u,[x(0), t] — uO, 2) + ex(d). 


i 


| wile, t+ O(@) At] de 


“0 “0 


By using the continuity of u,,(z, t) and by considering a finite region 0 < x < 2x(0), 
0 <t < 7%, the term ex(t) may be neglected since x(t) < 7*. Since u(z, é) is continuous 
in the same region as w,,(2, t), the 


Lim {u[¢(Ad), ¢ + Ad]} = ufzx(d), t] = 0. 


Therefore, we may let At — 0 on the right hand side of Eq. (29) to obtain 


dx(t) ‘ 
——- = —u,[z(0), ?t]. (31) 
dt 
To show the uniqueness of = 2(t) = Lim,... [x,(], assume two solutions of our 
problem, U(x, t) producing the curve x = X(é) and V(z, t) producing the curve « = Y(t). 
If X(t) = Y(é) then it is already known that U(z, t) = V(a, t);? so assume X(t) > 


Y(t) fort > 0. From Eq. (12) we have 
aX(t) 


X()=t-| U(z,tde 


/0 


a Y(t) 
Yj)=t-—| V(x, daz. 
Form the difference 
»¥(t) »X(t) 


X(t) — Y(t) = [V(z, t) — U(a, td] dx - U(x, t) dx. (32) 


“ Y(t) 


Since by Eq. (15), U(z, t) > 0, set W(x, t) = V(x, t) — U(a, t) which satisfies the 


equation 
W,.(z, t) = W,(z, (33) 


with the boundary conditions 
W,(0,t) = 0 and W{[Y(), 9 = —Ul[Y(, 4 < 0. 


‘L. Bieberbach, Differentialgleichungen, Dover Publications, New York, 1944, pp. 391-392. 
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Since W,(0, t) = 0 reflect this solution about x = 0 so that W(z, ¢) still satisfies Eq. 
(33) but with the alternate boundary conditions 


W(-Y, ] = —U[Y(,4 < 0 
and 
W([Y(), J = —U[Y, < 0 


Applying Lemma 1 to W(z, t) we see that since it cannot satisfy W(x, t) = C, its maxi- 
mum value is at (0, 0) and 


W(x, t) < 0. 


By forming the difference. 


Y(t) X(t) 
x~-YoO=[ Weda-[ U@,ddr <0, (34) 
“n “Y(t) 
Eq. (34) contradicts our assumption that X(t) > Y(t). Since X can be replaced by Y 
and U by V in the above argument to give a contradiction on the assumption that 
Y(t) > X(é), then X(t) = Y(t). The uniqueness of our solution is shown under the 
assumption that X(¢) and Y(t) do not intersect infinitely often as t — 0. 
The author wishes to thank Dr. L. Norenberg and Dr. Y. W. Chen for their sug- 
gestions and especially to thank Dr. E. Isaacson for proofreading this note and offering 
improvements in methods and organization. 











—NOTES— 


THE CONSERVATION OF SYSTEMS IN PHASE SPACE* 
By ERNEST IKENBERRY (Louisiana State University) 


1. Introduction. Important progress in the development of a statistical theory of the 
transport phenomena in liquids, based on the application of Gibbs’ principle of ‘“con- 
servation of density-in-phase,”’ has been made in recent papers by Kirkwood,” 
by Born and Green,* and by Jaffé.* Kirkwood and Born and Green have derived the 
Maxwell-Boltzmann integro-differential equation and have developed a general statistical 
mechanical theory of transport processes, by application of the principle of continuity 
in phase space. In their method of treatment of this principle, Born and Green defined 
a set of multiform distribution functions and obtained a generalized equation of motion 
referring to a cluster of / molecules. This generalized equation reduces to the equation 
of motion of ordinary hydrodynamics when h = 1. Born and Green also obtained ex- 
pressions for the coefficients of thermal conductivity and of viscosity, but have not 
published any numerical results. 

Using a method analogous to that introduced by Boltzmann’ in the kinetic theory 
of gases, Jaffé’ obtained a solution for the distribution function in the form of a multiple 
power series proceeding according to powers of the kinetic energy and resultant mo- 
ments. Previously determined® potential functions were used to calculate theoretical 
values for the coefficients of thermal conductivity and of viscosity of ten liquids. The 
numerical results obtained agree reasonably well with the observed values, in most 
instances. 

In each of the three methods of treatment the consideration of boundary con- 
ditions has been almost completely avoided. Kirkwood and Born and Green have sug- 
gested making the distribution function vanish at the boundaries, and Jaffé limited his 
considerations to the neighborhood of a particle in the interior of the liquid. However, 
it would seem that, since the coefficients of heat conductivity and of viscosity are 
defined in terms of the transfer of thermal energy and of momentum through boundary 
surfaces, the theory of these dissipative processes cannot be considered as sufficiently 
well developed without the further consideration of boundary conditions. In our treat- 
ment of the principle of conservation of density-in-phase, the boundary conditions to 


*Received June 5, 1950. The contents of this paper form the first part of the author’s doctoral dis- 
sertation (May, 1950) at L.S. U. The second part of the dissertation includes the calculation of numerical 
values for the coefficients of heat conductivity and of viscosity for ten representative liquids, by means of 
a multiple Bruns’ series expansion of the distribution function and by an equivalent “generalized mo- 
mentum integral’ method. 

1J. Willard Gibbs, Collected Works, Longman, Green & Co., New York, 1931, Vol. II, Part I, “Ele- 
mentary Principles in Statistical Mechanics,”’ Ch. 1. 

2J. G. Kirkwood, J. Chem. Phys. 14, 180 (1946). 

8M. Born and H. S. Green, Nature 159, 251 (1947); Proc. Roy. Soc. London (A) 188, 10 (1946), and 
subsequent papers. 

4G. Jaffé, Phys. Rev. 69, 688 (1946), 75, 184 (1949). 

5... Boltzmann, Vorlesungen tiber Gastheorie, Johann A. Barth, 1896, 1. Theil, ‘“‘Theorie der Gase mit 
einatomigen Molekiilen, deren Dimensionen gegen die mittlere Weglinge verschwinden,”’ p. 184. 
6G. Jaffé, Phys. Rev. 62, 463 (1942). 
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be applied refer to the boundaries of the region of phase space occupied by the systems 
of a virtual ensemble, rather than to the boundary condition problems as treated in 
hydrodynamics. The formulation of boundary conditions in phase space will be preceded 
by the statement of what may reasonably be assumed regarding boundary conditions in 
real space, in the most general case of a fluid contained in a finite volume V with moving 
boundaries. 

A fluid system bounded by the walls of a container consists always of the same 
particles, but consideration of such a system is complicated by the presence of a boundary 
region of finite thickness where the particles of the fluid are in interaction with the 
molecules of the material comprising the walls. We may simplify the boundary condi- 
tions, in some respects, by considering a system which is entirely surrounded by a larger 
system of the same kind. Due to diffusion, the boundaries of the real system would 
become ill-defined if we were to require that the real system is to always consist of 
the same particles. We may, however, define the motion of the boundaries of the real 
system in such a way that, during any reasonably short interval of time, there is no 
mass current (see Eq. (2.12)) through the boundaries. During such an interval of time, 
there is no net diffusion of particles through the boundaries of the real system. 

The analogy between the hydrodynamical fluid in real space and the imaginary 
fluid consisting of systems of the virtual ensemble in phase space may be extended by 
conceiving of a diffusion of the systems of the virtual ensemble. In order that the 
boundaries of the virtual ensemble should not become ill-defined, it is sufficient that 
the boundaries of the virtual ensemble should be defined in terms of the previously 
defined boundaries of the real system in real space. The boundary condition to be 
applied is that, during any reasonably short interval of time, there is no net diffusion 
of systems through the boundaries of the region of phase space occupied by the systems 
of the virtual ensemble. During such an interval of time, both the number of part.cles 
in the real system and the number of systems in the virtual ensemble remain constant. 

In the present paper, boundary conditions of this very general nature will be applied 
in the derivation of an equation of transport analogous to the equations of transfer 
derived by Maxwell’ in his kinetic theory of gases. The ordinary equation of continuity, 
the hydrodynamical equations of motion, and the equations of thermal and of total 
energy will then be obtained as special cases of this general transport equation. 

2. Preliminary definitions. We are considering a system of WN identical particles, 
each of mass m. The state of the system at any instant is determined by the values of 
the 3.N position coordinates (x, , y. , 2.) and of the corresponding momenta (m(dz,/dt) , 
m (dy,/dt) , m (dz,/dt)) of the N particles. Let 


Qsa-2 =e, Qo-r=Yo, Yo=%, a@=1,2,---N, (2.1) 
and 
dq; : : 
n=m—, a= 1,2, --- 3N. (2.2) 
dt 
Then the 6N values of the gq; and the p; at any instant determine a point in a 6N di- 
mensional phase space. This point is representative of the instantaneous state of the 


7J. C. Maxwell, Phil. Trans. 157, 1 (1886), or Collected Works, II, p. 26; J. H. Jeans, The dynamical 
theory of gases, The Univ. Press, Cambridge, 1925, Ch. IX. 











ERNEST IKENBERRY 197 





1951 


system, and the progress of the system in time is represented by the motion of its 
representative point along a path in phase space. Following a method developed by 
Gibbs,’ we consider a virtual ensembie of N such systems, each consisting of N identical 
particles, all of the systems being subject to the same internal and external forces, but 
distinguished in that they are distributed among the various states accessible to the 
actual system. In general, the ranges of the position coordinates g; are determined by 
the physical volume occupied by the particles of the actual system. The ranges of the 
momentum coordinates p, are limited by the maximum kinetic energy which the real 
system may have, but to avoid mathematical difficulties, we will follow the customary 
procedure by taking all real values for the ranges of the momentum coordinates 
Letting 


3N 


dQ, = || dq; dp, (2.3) 


t=1 
be the element of extension in phase space, we write 
dN = f(q, p, t) dX (2.4) 


for the number of systems in dQ, at time ¢. The distribution function f, which gives 
the instantaneous density of systems in phase space at the point (q, p), is in general a 
function of the 6N + 1 variables gq, p, t. This function is subject to certain conditions 
analogous to the conditions first formulated by Hilbert® for the distribution function 
of a gas in position-velocity space. In a strict sense, f is not a continuous function of 
the 6N + 1 variables. Nevertheless, it is assumed that the density of systems in phase 
space is so great that f can be approximated to a sufficient degree of accuracy by a 
function which is continuous and has continuous derivatives with respect to the phase 
coordinates and the time. Since a density is of necessity a positive quantity, the dis- 
tribution function must not become negative within the ranges of these 6N + 1 variables. 
The distribution function must also be symmetrical in the phase coordinates of any two 
particles, inasmuch as any one of the N identical particles may be chosen as the repre- 
sentative particle. It is further assumed that the distribution function vanishes suffi- 
ciently rapidly, for large values of the momentum coordinates, to assure the existence 
of all integrals of the form { f¢ dQ, , where $(q, p, t) is any polynomial in the momentum 
coordinates with coefficients continuous in the position coordinates and in the time. 

The average value, (¢), of a function ¢ of the phase coordinates and of the time, 
for a specified instant of time and for a specified position of a representative particle, 
say particle No. 1, is defined by 


(6) f fdQ, = f fodaQ,, (2.5) 
where 
ao, = TL da Tap. (2.6) 


is the element of extension in the subspace Q, of Q, . We note that Q, is the section of 
2) obtained by assigning specified values to q, , gz , and q; . Selection of particle No. 1 
as representative particle means that, henceforth, (q; , 2 , Ys) may also be considered 





8E—). Hilbert, Math. Ann. 72, 562 (1912). 
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as the coordinates of a point in real space and need not be distinguished from (x, , y; , 2;). 
From (2.5) and (2.6) it may be understood that (@) is, in general, a function of the four 
variables g, , g2 , gx; , and ¢t even when ¢ itself does not depend explicitly on all four of 
these variables. This arises from the, in general, complicated manner in which the dis- 
tribution function depends upon the position coordinates and on the time. 
Letting 
dr = dq, dqz dq; (2.7) 


be the element of extension in real space, and defining dN, by 


dN, = dr / jf dQ, , (2.8) 
we may make the transformation to real space in the customary manner by writing 
oa (2.9) 
N N 


where dN is the number of particles contained in d7 at a specified instant of time. Hence, 
by (2.8) and (2.9), the density n of particles in real space is given by 


— “= x | f dQ, . (2.10) 
We note from (2.5) and (2.10) that 
/ ith-Lee (2.11) 
N 
Letting 7, 7 = 1, 2, or 3, the components mu, of the mass current, the components 


S,; of the strain tensor, the temperature 7’, and the components H, of the heat current, 
at a specified instant of time and at a specified point (q, , G2 , gs), are defined by 


mu; = (p;), (2.12) 
S.; = = (pip!) (2.13) 
“yo m PiPi He de 
3nkT = S,, + Soo + S33 ; (2.14) 
and 
He = 573 Willy)” + (ws) + (9)"!) (2.15) 


respectively, where pi = p; — (pi). 
3. The conservation of systems in phase space. If we define the operator D by 


dq: 8 , dp. 2 . 
"5 . y {i aq, * dt dp;)’ (3.1) 
the hydrodynamical equation of continuity in phase space assumes the form” 
(2) + 2) 
a 0. 39 
ae > on dt Op; \ dt (3.2) 


%Jeans, op. cit., p. 71. 
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The phase coordinates are connected by Hamilton’s equations of motion, 





dq: _ d¢ dp; __ —0¢ 7 Ba 
dt "a OD; : dt al 04; + Ws , 7 I, 2, , ae -— 

































where ¢ is the Hamiltonian function for the system and the F; are the components of 
non-conservative forces. Hereafter we shall, in most cases, designate the dp,/dt by R; . 
The forces whose components are the R; may be partly of internal, partly of external 
origin. In case each force component FR; is independent of the corresponding momentum 
coordinate p; , the 3N terms in the summation in (3.2) vanish and (3.2) leads to Gibbs’ 
principle of conservation of density-in-phase, which is valid for non-conservative as well 
as for conservative systems, so long as 0R,/dp; = 0. 

When there is no diffusion of systems in phase space, the vanishing of the summation 
in (3.2) is equivalent to Gibbs’ principle of extension-in-phase. This principle, together 
with Gibbs’ principle of conservation of density-in-phase, would then imply conservation 
of the total number of systems in the virtual ensemble. However, when there is diffusion 
of systems in phase space, the motion in phase space of the boundary surface S, of 
Q) is not determined by (3.3), and, in order to obtain conservation of the total number 
of systems in the virtual ensemble, we must supplement (3.2) by the boundary condition 
that there is no net flow of systems through the boundaries of the region of phase space 
available to the systems of the virtual ensemble. We proceed to give the mathematical 
formulation of this boundary condition, for the general case in which the boundary 
surface S, of Q is in motion in any arbitrary manner. 

For complete generality, we consider a function y = f¢ where ¢ is any polynomial 
in the momentum coordinates with coefficients which may be functions of the position 
coordinates and of the time. Then 0/dt f y dQ is defined as the limit, as At — 0, of the 
difference quotient of [ y dQ, evaluated at times ¢ and ¢ + At: 


2 | va, p, 1) d% = Lim > { [ van, t+ a) d%- | va,r,0 as.) (3.4) 
dt Jo, at+o At Jo, Q 

where 2, and / are the regions of phase space occupied by the systems of the virtual 
ensemble at times ¢ and t + At, respectively. Assuming that ¥(q, p, ¢ + At) may, for 
sufficiently small values of At, be expanded as a Maclaurin’s series in At, we obtain, to 
terms of the first order in At, 


[ vart+aodyn=[ var,od,+arf Mama, os 


Hence (3.4) becomes 


re) [ oy ao 

of vam = [ Yaa +Lims do 3.6 

Ot Ja, ro o, Ot ato At J o,-9, neat alias 
where 2, — Q is the region of phase space through which the boundary surface S, of 
Q, moves during time Af. Letting V,(n) be the component of the velocity of motion of 
the element dS, of the boundary surface S, of 2, , normal to dS, , we may write 





d % = Vo(n) d Sp At, (3.7) 
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where dQ, is the region of phase space through which dS, moves during time dt. Hence 
(3.6) becomes 


of te] : ‘ 
Sf yaqn= | %a,+/ wrmas,. (3.8) 
ot / Qo / 2 ot “So 
We now let »() be the component of the velocity of motion of a representative point 
at the surface element dS, of the surface S, of 2, , normal to dS, . By Gauss’ theorem 


for the transformation of a surface integral into a volume integral we may write 


, en p BN 9 ( a) a ( ip.) 
/. y(n) dS, = I p> a. v di + dp, v di dQ. (3.9) 


Upon combining (3.8) and (3.9), we obtain, by means of (3.1) and (3.4), 


a = wo #:) J (cee) | 
Fy; I y dQ, = I | Dy +¥D> oi (44 + ap. \dé dX% 


t=1 


(3.10) 
+ / — W{Voln) — vo(m)} dSp . 


8 

With y = f, we see that (3.10) states that the rate of increase of the total number 
of systems in the virtual ensemble is equal to the rate at which systems are produced 
in Q, , augmented by the net rate of flow of systems into 2) through its boundaries. In 
order to obtain conservation of the total number of systems in the virtual ensemble, 
we shall apply not only (3.2), but also the boundary condition that there is no net flow 
of systems through the boundaries of the region of phase space available to the systems 
of the virtual ensemble, so that the surface integral in (3.10) vanishes. We note that 
the surface integral in (3.10) may vanish without the integrand itself vanishing identic- 
ally, that is, without V,(m) = v.(n). For equality of these two generalized velocity 
components would imply that there is no transport of any quantity ¢ through the 
boundaries of the system. As a concrete example, we may consider the case of a fluid 
contained within rigid boundaries, with heat being applied at the boundaries. Then, 
to the extent that the walls are rigid, the boundaries of 2, must also be considered as 
fixed, so that V,(n) = 0. But, as there is transfer of heat into the system through the 
walls, the particles must, on the average, rebound from the walls with increased kinetic 
energy. This may be pictured in phase space by imagining that a system of the virtual 
ensemble flows out through the boundary of Q, at the instant a particle strikes the 
rigid wall and another system flows in through the boundary when the particle re- 
bounds from the wall, an infinitesimally short time later. 

4. The equation of transport. An equation of transport, including the general case of 
moving boundaries, may be derived directly from (3.2) by the imposition of a boundary 
condition in the 6N — 3 dimensional subspace Q, of 2 , analogous to the boundary 
condition previously imposed in 2, . Let V,(n) be the component of the velocity of a 
point on the element dS, of the boundary surface S, of 2, , normal to dS, , and let 
v,(n) be the component of the velocity of a representative point, in the subspace Q, , 
at the moment that point is on or passes through dS, , normal to dS, . By exactly the 
same procedure as previously used, we obtain, corresponding to (3.8), 
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9 _ ff day 
at J, YIM = J, gg em tT [ vV,(n) dS, . (4.1) 


Since Q, is the space of all 3NV momentum coordinates but of only 3N — 3 position co- 
ordinates, application of Gauss’ theorem for the transformation of a surface integral 
into a volume integral gives 


; aN dq; wv 9 dp; 
/ w,(n) ds, = [ { ps aq, (v aa) oe Ze ap, (v _ )} aa... (4.2) 


Ss “Q i=4 t=1 





We now supply on both sides of (4.2) the three terms necessary to make the indices in 
both summations to run from 1 to 3N, combine the result with (4.1), replace ¥ by f¢, 


and apply (3.2) to obtain 


of > oO Pi 
at A Ip en, + > dq; / f m ¢ dQ, 


(4.3) 
=| sdeay+ | ftv - n()} a8, . 


When each member of (4.3) is multiplied by N Ar/N, where Ar is any fixed element 
of volume in real space, the first term on the left hand side of the resulting equation 
then represents the time rate of increase of the total amount of ¢ in Ar, and the sum- 
mation represents the net rate of flow of the quantity ¢ out of Ar, through its boundaries. 
The first term on the right hand side, involving D¢, then represents the rate of increase 
of the total amount of ¢ in Ar, due to the explicit dependence of ¢ on the phase co- 
ordinates and on the time. However, V,(n) and v,(n) are generalized velocity com- 
ponents in the imaginary subspace ©, of phase space, and the surface integral over 
dS, , in the right hand side of (4.3), then represents the rate at which the quantity @ 
is transported into Ar, but not through its boundaries. The real physical space is a 3- 
dimensional subspace of phase space, and there are paths in generalized coordinate 
space which lead from the exterior to the interior of the closed volume in real space, 
yet which do not pass through the boundaries of the real volume. As the transport of 
any physical quantity from the exterior to the interior of a closed volume, by a path 
not passing through its boundaries, is experimentally unobservable, we require that 
the surface integral in (4.3) shall vanish. We thus obtain a first form of the equation of 


transport: 


0 9 i f 
2 | # dQ, + Dag, | fee ao, = | 108) dQ, , (4.4) 


1=1 

which is valid for any quantity @ whose value for the entire system is equal to the sum 

of its values for the individual particles, and for which the integrals involved exist. 

A second form of the equation of transport (4.4) is readily obtained by the aid of (2.11): 
a 1s 9a 

— — =— (np.o) = nD (4.5 

31 (%) + in Du Bq, (nd) = n{D¥), (4.5) 

since both N and N are constant. The right hand side of (4.4), and of (4.5), remains to 

be developed, for any particular ¢, by means of (3.1). 
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5. The equation of continuity, the equations of motion, and the energy equations. 
Letting ¢ = 1, we obtain from (4.5), by means of (2.12), 
= + > 7, is) = (5.1) 
t=] 


which is the equation of continuity. Elimination of dn/dt from (4.5) and (5.1) results 
in a third form of the equation of transport: 


3 
= ~ (5.2 
£@)+— oF z= * (np'd) = (D9), (5.2) 
where the operator d/dt is defined by 
d 0 0 ae 
Hoek > + > U; aq, . (5.3) 
Letting ¢ = p; , Dé = R; , in (5.2), we obtain the hydrodynamical equations of 
motion, 
a, 1 — 0S 
— (mu;) + - —— = (R;), = 1, 2, oré (5.4 
ay mei) eo > aq, (R;), J 1, 2, or 3, ).4) 


the components S,; of the strain tensor having been defined in (2.13). We note that 
the mean force acting on the representative particle may be non-conservative even 
when the forces acting on the individual particles are conservative. Conservative in- 
ternal forces may tend to increase or decrease the disordered motion of the particles 
composing the system, and this is interpreted, from the macroscopic point of view, as 
the effect of non-conservative forces acting on the parcels of the fluid, as in the ap- 
pearance of tangential surface forces when there are velocity gradients. 


In order to obtain the equation of thermal energy we may let ¢ = (1/2m) 5—?_, (p/)? 

in (5.2). By the aid of (2.12) to (2.15) and of (3.1) we obtain 
dad (3 )+3 dH; au; 1 : — 
- (3 kT) + - > ie <z > S;; = aa > (Rip'). (5.5) 


It is seen that external conservative forces, which depend at most on the position co- 
ordinates of the particle on which they act, and possibly on the time, make no contri- 
bution to the right hand side of (5.5). In the case of an ideal gas, whose internal energy 
is entirely kinetic, the right hand member of (5.5) vanishes. The resulting equation 
may then be compared with equation (13) of Enskog’s dissertation.’° The terms in the 
right hand member of (5.5) become negligible in the case of a moderately dense gas 
for which the internal forces decay sufficiently rapidly with distance. 

The equation of total energy may be obtained from (5.2) by letting @ = €, = 9,°' + 
¢:) + (1/2m) (pi + p2 + ps), where yg,” and ¢;" are the time independent external 
and internal potentials of the representative particle. The external potential ¢,"' will 
be assumed to be a function of the position coordinates q, , 72 , and q; of the representative 
particle only. Furthermore, only binary attractions and repulsions will be considered, 
so that the total potential of the entire system may be expressed as the sum of the po- 

TD), Enskog, Kinetische Theorie der Vorgange in mdssig verdtinnten Gasen, Inaugural Dissertation, 
Upsala, 1917, p. 18. 
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tentials of the individual particles. Upon substituting ¢, for @ into (5.2), multiplying 
the resulting equation by n dv, and integrating over dr at a constant time, we obtain, 
by the aid of (2.3), (2.6), (2.7), (2.11) to (2.15) and (3.1) to (3.3), and by considerations 


of symmetry,* 


ei ~ 9 : - Gi 
ne) dr + / ) Tg (n, + ie S..u;) dr + / } ns ‘2. (pie; »} dr 


dt e t=1 04: — 
(5.6) 
= [ z = (p.F;) dr, 
ime 
where 
(a) = 5 muh + ub + us) + of? + I”) + 3 kT. (5.7) 


Upon transforming the second and third volume integrals on the left hand side of (5.6) 
into surface integrals, we see that this equation may be interpreted as stating that the 
time rate of increase of total energy of the system is equal to the rate at which the non- 
conservative body forces F, and surface forces S;; do work on the system, augmented 
by the rates of transport of thermal energy of disordered motion and of internal potential 
energy through its boundaries. The importance of this latter portion of the energy 
flux, arising from the strong intermolecular forces in the case of a liquid, has been 
pointed out by Born and Green.” This flux is not contained in the expression (2.15) for 
the components of the heat current. 


*In the equation obtained by letting ¢ = a , in (5.2), we interchange the phase coordinates of the 
representative particle with those of each of the other N — 1 particles in turn, and add the resulting 
equations. Then, by considerations of symmetry and by the aid of (2.3), (2.6), (2.7) and (2.11), f nleaddr 
= (N/N) f fea dQ = (N/N) f fe dQ = f n(e:) dr. A similar method of treatment applies to the other 


terms on the left hand side of the combined equation. The right hand side of this equation be- 
comes (N/N f f(De) dQ), = (N/N) f f > sl (piF;/m) dQ, = N , (m/n)(piF';:) dr, by the aid also 


of (3.2) and (3.3) 
11M. Born and H. 8. Green, Proc. Roy. Soc. London (A) 190, 455 (1947). 


NOTE ON THE HAMEL-SYNGE THEOREM* 
By F. H. van DEN DUNGEN (Université Libre de Bruzelles) 
The theorem given by Synge’ for a plane motion of a compressible viscous fluid is 
easily extended to a three dimensional motion. 
Consider a compressible viscous fluid which moves inside a fixed closed surface B, 
on which the velocity vanishes. Our theorem is: A velocity v(x, y, z) is consistent with 
the foregoing boundary condition if and only if 


[ (A-curl v — f div v) dx dy dz = 0, (1) 


*Received June 16, 1950. 
'Q. Appl. Math. 8, 107-108 (1950) 
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where f is an arbitrary harmonic function and A a conjugate harmonic vector such that 
grad f + curlA = 0. (2) 
Proof. We have 


/ (A- curl v — v-curl A) dx dy dz = -|/ A-(v x n) dS 
Je é 
8 


(here n is the normal unit vector) and 


[ (f div v + v-grad f) dx dy dz = | fv-n dS, 


/V 
R 


and therefore it follows that 


[[ aw xa) + fom as (3) 
B 
vanishes if and only if Eq. (1) is satisfied. However, (3) will vanish for arbitrary har- 
monic f only if a continuous Vv vanishes everywhere on the surface B, and thus (1) is 
necessary and sufficient for the satisfaction of the viscous boundary condition. 


NOTE ON ITERATIONS WITH CONVERGENCE OF HIGHER DEGREE* 
By HASKELL B. CURRY (Pennsylvania State College) 
In a recent paper’ E. Bodewig has derived a general expression for a function F(x) 
such that the sequence 
Zn4, = F(z, (A) 


converges in a degree at least m in the neighborhood of every root of a polynomial f(z), 
provided the latter has only simple roots. A part of this argument can be simplified, 
while another part can be made somewhat more natural. 

In regard to the first point, the operator P used by Bodewig is the same as d/dy 
where y = f(x). Further, since his r is 1/f’, we have r = dx/dy = Px. Hence the identity 
between Bodewig’s formula (14a) and the Euler inverse Taylor expansion (15) follows 
immediately; it is not necessary to use the recurrence formula for the higher derivatives 
of an inverse function. 

In regard to the second point, if we have a sequence of functions F,(z), F.(z), 
such that the sequence (A) converges in the degree at least m for F = F,, , then any 
F(z) giving rise to a sequence converging at least in the degree m must be of the form 


F =F, +9,f”. 


*Received August 29, 1950. 
1E. Bodewig, On types of convergence and on the behavior of approximations in the neighborhood of a 


multiple root of an equation, Q. App]. Math. 7, 325-333 (1949). 
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Now F,,,, is itself such an F. Hence, by induction, we have, for suitable g, , 
F, = 2+ of + of + °*> + Ga-f” 


This suggests a change of variable to y = f(x) (which is possible since f’(X) ¥ 0). If 
x = u(y) is the inverse function, and 


®,(y) = Falu(y)], ely) = gelu(y)], 


then 
o, = ut hy + voy? + e+ + ny” (B) 
The conditions which must be satisfied by ®,, are 
6(0) = X, 


©’'(0) = &’'(0) = --- = 60) = 0. 
The first of these is automatically satisfied. 
Now a function ®,,(y) satisfying these conditions is given immediately by the inverse 
Taylor expansion of X = u(y — y). In fact, if we set #,,(y) equal to the sum of the 
first m terms of this expansion, viz.: 


®,,(y) = > i u(y), 


k=0 


then 
X = oy) +P yr) 
m m! 4 


and hence 


, (—9)” cus 
&,(y) = X — ———u 
(y) = (n) 
satisfies the above conditions. 
This method avoids the necessity of slapping down Bodewig’s formula (14) or of 
motivating it by tedious experimenting with small values of m. 


BOUNDARIES FOR THE LIMIT CYCLE OF VAN DER POL’S EQUATION* 
By R. GOMORY anp D. E. RICHMOND (Williams College) 
1. Introduction. In non-linear mechanics much interest centers on the Van der Pol 
(VDP) equation 
d’x 


2 a ns 
ae te@ -—Dg+2=0 (1) 


*Received Sept. 11, 1950. 
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or its equivalent in the phase plane 


dx _ —— _ 9 
dt = 7, dt — a + u(1 a )y. (2) 


It is well known that (2) possesses a unique trajectory which represents a limit 
cycle in the sense of Poincaré. Using a different plane, La Salle’ has located this limit 
cycle between two boundary curves in a very ingenious manner which however seems 
artificial and difficult to motivate. The present paper sets forth a simple and natural 
method for constructing outer and inner boundaries. The method admits of unlimited 
improvement but even its simplest application gives results superior to La Salle’s in 
that the limit cycle is localized somewhat more sharply. 

In the phase plane all trajectories other than the limit cycle spiral into it from the 
inside or the outside. The curves are described clockwise with increasing ¢. We use these 
facts to enclose the limit cycle between an outer boundary By and an inner boundary B, . 


Introducing r° = x + y’, we transform (2) to 
dr — dx 
r— = pl — r)y —— me wt, 3 
i= Ys a =y (3) 


Eliminating ¢, one obtains for the trajectories 
r— = p(l — z’)y. (4) 


Since the field of (4) is symmetrical in the origin, it is sufficient to discuss solutions in 
the upper half-plane (y 2 0). 

If C is a curve, r = F(x), y 2 0, which intersects the z-axis only at (—a, 0) and 
(a, 0) and if at every point the value of dr/dz for C is greater than or equal to that of 
the VDP solution through that point, all VDP curves intersect C from above to below. 
Then C together with its image in the origin forms an outer boundary B, . 

The construction of an inner boundary B, requires the substitution of less than for 
greater than in the above statement. 

2. The outer boundary. To construct an outer boundary B, , write (4) in the form 


ro = wl — 2°)(r? — 2”), y 209, (4’) 
dx 
On an z-interval within which 1 — 2’ is positive, replace x’ under the radical by 


(x) min » its least value on the interval. The curves defined by the solutions of 


es wl — 2”)[r? — (2") min 


y = 0, (5) 
dx 


. 
have at every point of the interval a value of dr/dx greater than or equal to that of the 
corresponding VDP curve. 

Similarly, on an z-interval within which 1 — 2’ is negative, replace x 
radical by (x”)max , its largest value on the interval. The curves defined by 


2 


under the 


1J, La Salle, Relaration o scillations, Q. App]. Math. 7, 1-19 (1949). If La Salle’s co-ordinates are (zx, u), 
the relation to ours is given by y/u + u = xz — (2/3). His ¢ is » times ours. 
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r% ul— a) — ml, 9 20, 6) 
have at every point of the interval a value of dr/dx greater than or equal to that of the 
corresponding VDP curve. 

It remains to join together solutions of (5) and (6), valid over different intervals, to 
generate a continuous curve which will serve as C, the portion of B, in the upper half- 
plane. 

This boundary, at least for large u, may be expected to lie rather close to the limit 
cycle since over much of the short zx range, x’ is small relative to r’ so that the error 
made by the proposed substitution is not serious. In fact, by using a large number of 
intervals a very accurate outer boundary may be constructed. But even the simplest 
outer boundary, using three intervals, is surprisingly good. We proceed to the details 


for this case. 








Fig. 1. Sketch of Boundaries. 


Let C intersect the z-axis at —a and a. The intervals to be used are [—a, —1], 
[—1, 1] and [1, a]. We start at —a and work across to a in the upper half plane. The 
equations become 


[—a,-1) 2 = — 2"? — 0)”, 
(1,1) rZ=all — 29" - 0", 
[l, a] r x = wl — 2’*)(r — a)”. 


Each of these equations is separable and possesses a general and a singular solution. 
We use the singular solution r = a for [—a, —1] since it is the only solution through 
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(—a, 0) which is real in the interval. For the other intervals, we use the general solutions 


(r? — ¢’)'” : uz — =) +k. 


It is easy to choose the constants k to secure continuity. 
The principal interest is in the amplitude a which determines the size of the en- 


which take the form 


closure. To find a, we take definite integrals successively over the intervals [—a, —1], 
[—1, 1] and [1, al], finding in each case the relation between r at the left side and r at 


the right side of the interval. The results are 


r(—1) -a = 0, 


r(1) — r(—1) = - (7) 


9" ’ 


[r°(1) — a’]'” : (a®* — 3a + 2). 


Combining 
. ‘ Sagi"? 
a —3a+2-4 1+ — 
Zu 


Thus even in the simplest case there is a slight improvement over La Salle’s result, 


which in this form is 


a®* — 3a+2 — a(1 oo 3a) = 0. 


4u 
We quote some numerical results for 4 = 3. La Salle’s boundary gives a = 2.21". 
Our result is a = 2.18~. If seven intervals are used (joining at —3'”, —1.6, —1.4, —1, 
1 and 3'””), one obtains a = 2.107. 
3. The inner boundary. To construct an inner boundary B, , use 
dr ec 1/2 
r = pl — 2x )(r — 2’), y > 0, (4’) 
dx 
as before, but replace x” under the radical by (x*)max if 1 — 2x’ is positive within the 
z-interval, by (x”)mia if 1 — 2” is negative there. 


Using three intervals [—c, —1], [—1, 1] and [1, b], we obtain the equation 


pF = wl - 2°? — 1)", 
applicable to all three intervals. One is tempted to set c = b and reflect in the origin: 
There is however a difficulty. To the left of s = —1, dr/dz is negative but —dr/dx S 1 
for x = —c. This condition restricts the choice of c. 
Since r(— dr/dr) u(a” — 1)(r? — 1)'”, y = O, and since on the z-axis, r = c, it 


follows that 
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The largest c corresponds to the equality. If 

d inate u(d? Fae 1)°”” 
defines d for a given yu, we therefore choose c S d and follow the curve along the upper 
half-plane to its right-hand intersection at b. Reflection of this curve in the origin leaves 
two gaps, one between —b and —c and one between c and b. If b > c we may use seg- 
ments of the z-axis to complete the inner boundary. Since the VDP curves are described 
clockwise with increasing t, these curves cross the added segments in the required 


direction. 
The relation between —c and b is given by integrating between these limits and is 


(b* — 1)7 — @ — 1)” = AC = +c-— “). (9) 


We know that ¢ S d. The largest inner boundary using three intervals arises from the 
choice c = d. However, for the sake of simplicity, we may obtain an inner boundary 
valid for all » by choosing c = 1. Then (9) becomes 


b° — 3b -—2+4+ 3 (42 — 1)'” =90, 
K 


which is a slight improvement over La Salle’s result 


2 4 
m 





be — 3b -—- 2+ 


For » = 3, for example, La Salle’s result is b = 1.77. Ours is b = 1.81 with ec = 1 
and b = 1.87° with c = d = 1.248. If additiona] points of division are placed at —.5, 
5 and 3'”, b = 1.94 is obtained. 

4. Conclusion. In conclusion, it is clear that the simplicity of the calculations makes 
it relatively easy to obtain indefinitely better boundaries by increasing the number of 
intervals. Over these intervals, the form of the solution is (with one exception) always 
the same, different (x)2,. and (x)2,, being inserted. It is therefore sufficient to be armed 
with a table of square roots and cubes to find a or b by trial solution. With a moderate 
amount of work, it is fortunately possible to supplement the method of perturbations 
which is useful for » < 1 and the known results for 4» —~© by giving a good account of 
the limit cycle in the intermediate range of yu. 

It is also obvious that the method of this paper can be applied if x” — 1 in Eq. (1) 
is replaced by other suitable functions f(x). Further generalizations are possible but 


will not be discussed. 
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A THIRD ORDER BOUNDARY VALUE PROBLEM ARISING IN AEROELASTIC 
WING THEORY* 


By GEORGE SEIFERT** (Cornell University) 
The differential equation arising in the problems of chordwise divergence or swept- 


forward wing bending divergence [1] is of the type 


& 


dx 


EI(x) >... Ac(x)z = 0 (la) 
dx 

where \ is a parameter. The boundary conditions are 2(0) = 2’(l) = (EI (x)z’(x))!_, = 0. 
Since this represents a non self-adjoint boundary value problem, the ordinary Rayleigh- 
titz variational method for approximating its characteristic values is not applicable. 
Formal extensions of this variational approach have been suggested by Flax [1] and 
used by Cheng [2] but no mathematical evidence for their validity is at present avail- 
able. In this connection, two questions have been raised [1]. 

(A) Under what conditions on H/(x) and c(x) are all the characteristic values of 
(la) real? 

(B) Under what conditions on EJ (x), c(x) and f(x) can the solution z,(x) of 


[EI (a)z’(x)]’’ — rAc(x)ze(x) = f(x), 2(0) = 2’(l) = (Elz’); = 0 


be expanded in terms of the biorthogonal system of functions arising from the system 
(1a) and its adjoint? 

Although this paper concerns itself chiefly with (A), it may be pointed out that the 
investigations of expansions in terms of characteristic functions of (la) for EJ = 1, 
c(z) = —1,1 = x carried out by L. E. Ward [3] indicate difficulties inherent in such 
irregular boundary-value expansion problems as (B). 

Part I of this paper deals with the characteristic values of the system 

ul’'(x) + p(x)u’(x) + [g(x) + A]u(z) = 0, u(0) = u’(0) = w’’(1) = 0, (1) 
where p and q are real-valued functions analytic on 0 < x < 1. It is found that: 

(a) this system has an infinite number of real characteristic values (Theorem 1); 

(b) if the upper bounds of | p(t) | and | q(t) |, and the positive numbers | p(1) |, 
fo | p(t) | dt, fo | r(t) | dt, where r(t) = g(t) — p’(t), are small enough, then all char- 
acteristic values of (1) are real (Theorem 3). 

Just how small these positive quantities in (b) must be, in order that all the char- 
acteristic values of (1) be real, is not explicitly considered. For specific functions p and 
q, however, the details of the proofs of (a) and (b) enable one to answer this question by 


means of simple numerical computations. 
Much of the method of proof and notation is similar to that used by Ward [4] in 


his study of the system: 
ul’"(x) + [p? + r(x) ]u(xz) = 0, u(0) = u’(0) = u(r) = 0. 


Here r(x) is of a special form which permits Ward to obtain an expansion theorem. It 


*Received Oct. 17, 1950. 
**Now at the University of Nebraska. 
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is hoped that Ward’s approach will also suggest an expansion theorem associated with 
(1) and perhaps also provide an answer to (B). 
Part II indicates a pair of transformations which takes the system 


[fy (OY + Ag@y = 0, y(0) = y’'(0) = y’"(1) = 9, (2) 


where f(t) > 0, g(t) > 0, are real and analytic on 0 < ¢ < 1, over into a system of the 
form of (1). Note that the adjoint of (2) is of the form of (la), and consequently the 
characteristic values of (2), being identical with those of its adjoint, are of interest in 


aeroelastic wing theory. 


PART I 


We consider the system (1). Define 


‘ e /a\ t t wst 
(a) 8(t) = e* te%" +e 
oe wt Wet Wat 
6.(t) = ¢ — sf — Wl 
e it vat 
6.(t) = ¢ — we” — wee" 
where 
: _ 1 es (ri) /3 a —(ri)/3, 
= 1, @W2 = ¢ ) Ws = € ’ 


(b) the complex number p by p* = X, | arg p| < 2/3; 

c) the regions S, and S, of the p plane by 0 < arg p < 2/3 and —7/3 < argp < 0 
respectively. 

Lemma 1. A necessary and sufficient condition that u(x, p) satisfy the equation in 


(1) and u(0, p) = u’(0, p) = 0' ts that 
u(x, p) = 63(px) — (3p°)' [ {5,[o(x — dIr(t) — pd.[o(x — d]p(Djult, p=) dt = (1.1) 


where r(t) = q(t) = p(t). 


The proof of this lemma is completely analogous to that of Theorem 1 of Ward’s 


paper’ and is omitted here. 


Lemma 2. A solution u(x, p) of the equation in (1) such that u(0, p) = u’(0, p) = 0 


18 given by 
u(x, p) = e**”[—w; — we’? *'? 4 2(x, p)] (1.2) 


where | z(x, p) | < m, m being independent of x and p, provided peS, and | p | sufficiently 


lar ge. 

Unless otherwise indicated, the prime will always denote differentiation with respect to the first 
variable. 

2L.. E. Ward, [4], p. 418. 
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Proof. Substituting (1.2) into (1.1) one obtains 


e***[A(x, p) + 2(z, p)] 


pz 


= 6;(px) — (89°) | {ds[o(x — t)Ir(t) — pb.[o(x — A) ]p(}[ACE, p) + 2(t, p)]e**”* dt, 


where A(t, p) = —w; — we exp [(w2 — ws )pt]. Hence 


2(z, p) =e” 
(1.3) 
— 36)" | {aloe — Ir) — pdalole — DIp()}{ACt, 0) + 2lt, ple re-” dt. 
Now for fixed peS, | z(z, p) | attains its upper bound m(p) on 0 < x < 1; hence by (1.3) 
al 


mp) < 1+ (3]p|*)" | | ds[o(x — d]ACE, ple?" Pr(d) | dt 


i 
70 


el 


+B] el)" |] | dale — H(t, peer(s) | at 


“0 


el 
+(3l po)" | | dlole — DIAC, pe" -p(t) | dt 


v0 


1 
+ Blob" | | slole — ket, de" -?p(e | at 


0 


where 
1 1 
r= [ | r(é) | dt, p= [ | p(t) | dé. 
Hence 
Wr | |2 Pom | 
ata < eel tele) 
1—(r/|p | + p/| p}) 
for | p | sufficiently large, from which the lemma follows. 


Theorem 1. There exist an infinite number of real characteristic values X,, for the system 
(1); more precisely, there exists a real d, such that all d, with R(A,) > dr are necessarily 
real.° 


Proof. From (1.1) by differentiation, 


rif 


‘ : ew ; 
u'"(a, p) = p di(pr) — 3 [ {d,[o(z — d]r(t) — pdslo(x — O]p()}ult, p) dt 


“0 


— p(x)u(z, p), 





fz = xz + iy, then R(z) = z defines the notation R(z) which will be used throughout. 
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and the characteristic equation, u’’(1, p) = 0, becomes 


el 


p°8(p) — ; ] {alo — Or — poslo(t — HlpO}AG, o) + a(t, ede" dt 


— p(1)[A(1, p) + 2(1, p)Je**” = 0 
Define 


el 


E(p) = (3p) | { 5,[p(1 pa t)r(t) =~ pb,[p(1 a t)|p(t)} {A(t p) + 2(t, pirore-" dt 


+ p(1)[A(1, p) + 2(1, p)]p™*. 
It is easily verified that the zeros of 6,(p) are the zeros of exp (— 3p/2) + 2 cos [3'p/2] 
and are all real for peS, . 
It will now be shown that | p’5,(p) | > | p’ e*’*E(p) | for p on any one of a set of 
contours C,, in the p plane which are trapezoids whose bases are the segments 








Tv 
?(p) = 2(n + I) 3172 » argp|< 33 
) Tv 
R(p) - 2(n + 2) ai ’ arg p | < 3 ’ 
n=k,k+ 1, --- , k sufficiently large. It is re to show this inequality for the 
upper half of the contour C, ; namely Ci, = C, (\ S, . This is true because u(z, p) is 
real for real p and hence u(z, p*) = [u(z, p)]*, the ae denoting the complex conjugate. 
Hence u'’(x, p*) = [u’’(x, p)]*, and consequently p*[6,(p) — e**’E(p)] takes on values 
on CY’ = ©, (\ S, , the lower half of C, , which are the complex conjugates of those 
taken on on CZ , the upper half. Set F(p) = p’6;(p), G(e) = —p'e**°E(p); then since 


F(p*) + G(p*) = [F(p)]* + [G(p)]* and F(p*) = [F(p)]*, we have G(p*) = [G(p)]*, and 
hence pe°*°E(p) assumes values on C/’ which are the complex conjugates of those it 
takes on on CZ. It clearly suffices, then, to show that 


|e~°*?5,(p) | > | E(p) | for pon C;. (1.4) 
On p = a(1 + 3'7) we have 


| e~°""5,(p) | = | 2e** cos (3'a) + 1 |; 


17, |< rise + m [ I r() | dt = —5(2 + myr, 


where 
al 


I= 33] fexp [ — aol — 0] 
+ exp [(w2 — ws)o(1 — 0] + LILAC, ») + alt, e)Ir(O ats 


[Te] < g-32@+ m) | |p |dt = 5-2 + mp, 
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where 


at 
I, = = | fexp [@, — ws)p(1 — 2] 


— w, exp [(w2 — w3)p(1 — t)] — ws}[A(t, p) + 2(t, p)]p(d) dt; 


| p(1) 


s- (1 + e°°* + m), 
4a 


where 


I; = p(i)[AC, p) + 201, p)]p”. 


( 3 Qi/2, 7] 
|e °°" 6,(p =| expi- > | 2% + 1) gi + | + ~s | 2% + 1) ; + i | 


_ r | 
+ exp 43°°7] 2(n + 1) 3i 5+ 78 f + 1 
91/2 
ia) — 1)" exp & _~ s) 


On p = 2(n+ 1)z a + 18,8 > 0, we have 
) 


} 
; 


bo | Co 


= | exp [—3'(n + 1)z] exp fs 


+ exp (—3'8) + 1 : 


hence | ¢ °6:(p) | > 1 — exp (3'/"r); also 
, =. ae (2 + mjr 
| J, ee ¢ 2 1 902? 
4 [2(n + 1)x]° + 38 
a? (2 + m)p 
| I, < ~ i 


2 {[2(n + 1)r]’ + 3p7}'”’ 


7, | < 3120) _| 1 + exp (—3'"6) + m) | 
= [2(n + 1)r]° + 38° 


Now | E£(p) | < | Li) + | L2| + | Js |, and clearly for | p | large enough there exists 
k such that for n = k,k + 1, «++ , (1.4) holds. We have then | p°6,(p) | > | p% E(p) 
on C,, . Now it is easily verified that there is just one zero of 6,(p) inside C’, . Hence, 
by Rouché’s theorem, there is just one root of the characteristic equation inside C, , 
and since these roots must occur in complex conjugate pairs, this root is real. This 
proves the theorem. 

By a suitable modification of the choice of contours C,, in the proof of the preceding 
theorem, the following result may be obtained. 

Theorem 2. Let the zeros of 5,(p) be denoted by p,.’ such that py. < p\” Koss . Bee 
5 > 0 be given, and define R,, to be the circle | p — p,’ | < 6. Then for k large enough, each 
root p, of the characteristic equation of (1) such that R(p,) > 2kx 3'” lies inside one of 
the circles R, ,n = kik + 1, «++ , there being exactly one p, , necessarily real, in each 


such R,, . 
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The proof will be omitted.‘ 

From the details of the proof of Theorem 1, it may be seen that for r, p, and | p(1) 
small enough, k may be chosen equal zero. The question of whether or not in this case 
all characteristic values of (1) are real, then, reduces to the question of whether or not 
all roots p, of the characteristic equation for which R(p,) < 27/3'” are real. By means 
of the following lemma, we obtain a lower bound on | \, |; e.g., | p, |; and then by means 





of the subsequent theorem, apply the arguments of Theorem 1 to a supplementary 
contour which borders on C, and extends to the region for which no p, can occur. 


Lemma. Let \ be a characteristic value of (1). Then |X|? > 6 — (m, + m.) where 
ip) | < m, | gi) ae Ue 

Proof. Let v(x) = u'(x). Then by (1) 

(x) = —[playu’(x) + g(x)u(x) + dAu(z)], v(0) = v’(1) = 0. 
Hence 

c. b,, Aa, 

v(x) -_ ps + u ot = 

n Kn 


n(x) (1.5) 


where ¢,(x) and pu, are the characteristic functions and numbers respectively of the 


system 


¢’'(x) + ud(x) = 0, ¢(0) = ¢’(1) = 0 
and 


al 1 
a. ul(t)d,(t) dt, b, = | u(t)q(t)¢,(t) dt, Cc, = [ u’(t)p(b¢,(t) dt. 
/0 70 “0 
Hence by (1.5) and integration from 0 to x, one obtains 


Cc b ei 
>> - a Se 2,(2) where z,(2) = | d,(t) dt. 
, Kn “0 


u(a) 
Multiplying by ¢,(x) and integrating from 0 to 1 one obtains 


al al 

c, + b, + dra, ; 

a. = o,(x)u(x) dx = s he Z| zn(x)p,(x) dx. 
/0 n Mn 7) 

Hence, using Schwarz’s inequality, 


2 








la? SMV (la P+] +I1A/ | asf) | / Zn(x)o.(x) dx 


where 
l l 
M = =5=- +: 
mie} 
Summing over k and using Parseval’s theorem: 


TlarswCdaht lat iarlal Z| f eele) de 


x 


2 





(1.6) 
=MY(ahtluP +A lal) [ la@ Pade. 


*See L. E. Ward, [4] pp. 419-420. 
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However, since 


[ | z,(z) |? dx < if | b,(t) |” as} dz = 1, 


“0 
and since we may assume 


1 
/ | u(x) |? dx = 1, 


another application of Parseval’s theorem to (1.6) gives 


1 


al , 
i= uh | | p(t)u’(t) |? dt + 


J( 


| g(t)u(t) |’ dt + | a rh (1.7) 
Let | p(t) |? < m, , | q(t) |? < me. ; then from (1.7) one has 


L< um, [ | w(t) | dt + m. + ]A Ph. (1.8) 


Multiplying (1.5) by ¢,(x) and integrating from 0 to 1, one obtains 


| o,(x)u'(x) dx | = - _<—_— 3 
/0 | Me | ME 


loth tru |? - lo +1 & |? +1017] ao |? 


and summing over k and applying Parseval’s theorem again: 


al pl 


| | u(x) |? dx < um, | 


v0 70 


| w’(x) |? dx + m+ ]A °), 
from which it follows that 
1 
i= Ma) / | w(2) |? dx < M(m, + |r). (1.9) 


Hence, by (1.8) and (1.9): 
(1 — Mm,) < M[m,M(m, + | |?) + (m. + | |?)(1 — Mm,)] = M(m, + | }’). 


From this one obtains 


1 — M(m, + m.) , . 
M < | A | ° 





Since M = 1/6, the proof of the lemma is complete. 

Theorem 3. Let p, r, m, , Mz be defined as in previous theorems and lemmas. Then if 
these constants and | p(1) | are sufficiently small, all characteristic values of the system (1) 
are real. 

Proof. It has already been noted that for sufficiently small values of p, 7, and | p(1) |, 
k may be chosen equal zero. We now consider the supplementary contour C_, , a trape- 
zoid whose bases are the segments: 


2 
R(p) = 37 


A 





Tv 
argp | <3; 


R 
= | ar <-:; 
Ro) ==, — | arg | <=; 
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where R = [6 — (m, + m,)]'’°. By Theorem 1 and the lemma all non-real characteristic 
values \,, = p2 must be such that p, is in or on C_, . However, for sufficiently small p, 
r, mM; , m, , and | p(1) |, the argument used to prove Theorem 1 is applicable to C_, 
and we conclude that there is just one root p_, , necessarily real, inside C_, . This proves 


the theorem. 


PART II 


Consider the system 
[f(y + Agdy) = 90, yO) = y'0) = yy’) = 9, (2) 
f and g real and analytic on 0 < ¢ < 1, f(t) > 0, g(t) > 0. The transformation 


t ° 1/3 1 
r= } / h(s) ds where h(s) = Fa , d= i h(s) ds, 


has an inverse t = ¢(x). This inverse transforms (2) into 


2 


7. y[t(x)] + p,(2) os y[t(x)] + pe(z) f. y[t(x)] + ylt(x)] = 0, ; 
2.1) 
y(0) = y’(0) = y'"(1) = 9, 
where 
p(x) = , {3h (Ca) nL) + = nla) 
p(x) = , {ne teay ete] ao em ww), 


the notation f’[t(x)] meaning that the substitution ¢ = ¢(x) has been made in the func- 


tion of ¢: f’(d). 


Next, the substitution 


y[t(x)] = u(x) exp E 


Ole 


d 


[" pu) as 
0 
and subsequent multiplication by 


exp E [ p(s) as| 


reduces (2.1) to u’’”’(x) + p(x)u’(x) + [q(x) + AJu(xz) = 0, u(0) = u’(0) = u”(1) = 0, 
where 


r(x : 
p(x) = p(x) — P oo — pi(x), 


(x) — PL@) _ Pil@)prl@) | 
; 3 


q(x) 3 


— Ss 
Q7 


Hence the results of Part I apply to the system (2). 
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Noting the structure of p and q; e.g., p, and p, , in terms of f and g, we state an obvious 
corollary of Theorem 3. 


> 


Corollary. Let | f'()| < 6 |g" @®| < €forn = 1, 2, 3, 4. Then for « sufficiently 
small, all the characteristic values of (2) are real. 


It is also clear from the structure of p and q that less restrictive, although perhaps 
more complicated, conditions on f and g than those in the hypothesis of the above 
corollary will yield the same conclusion. 

The author wishes to thank Professors W. Feller and W. R. Sears for suggesting the 
problem and for helpful suggestions toward its solution. 
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ON AN EQUATION OCCURRING IN THE HARMONIC ANALYSIS OF 
VISCOUS FLUID FLOW* 


By RICHARD BELLMAN (Stanford University) 
1. Introduction. It was shown by J. Kampé de Feriet’ that the Fourier transform 
I ° - 
2(w, , Ww. , t) = i ¢(x, y) exp [—7(w,x + w.y)] dx dy (1) 
én” Ju 
of the vorticity, ¢(z, y), associated with the two-dimensional flow of an incompressible 


fluid extending over the entire (z,y)-plane, under mild conditions, satisfies the non- 
linear integro-differential equation 


S aw, ; We ’ t) _ —v(w; + w,)2(W, ’ We ] t) 
(2a) 
r=  [0,w. — Ow . 
+ | | CO +e (0, , 0. , O2(0, + w, , 0 + w., t) dé, dé, 
and the boundary condition 
z(w, , We, 0) = o(w,; , We). (2b) 


*Received November 8, 1950. The results contained in this paper were obtained in connection with 
research sponsored by the Rand Corporation. 
1J, Kampé de Feriet, Harmonic analysis of the two-dimensional flow of an incompressible viscous fluid, 


Q. Appl. Math. 6, 1-13 (1948). 
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It seems quite difficult to determine the properties of the solution of 2a for general 
@. Consequently, it may be of interest to indicate the following theorem which is an 
analogue of the well-known result of Poincaré and Liapounoff* concerning the stability 
of solutions of non-linear ordinary differential equations. 
TuroriM: Jf Max, | ¢(w, , Ww.) | is sufficiently small, there is a solution to (2a) and 
(2b) which is unique, and satisfies the inequality 
| e(w, , 0 , ) | < 2 Max [olen . ws) | (3) 
: ~ [1 + v(w; + wth’ 
for all w, , w, andt > 0. 
From this we conclude that the solution z = 0 of (2a) is stable. 





By the expression “‘sufficiently small’? we mean that there exists a constant c = 
c(v) with the property that Max |¢| < c suffices to establish (3). The constant c may 
be determined from the analysis below. However, we feel that the value of c obtained 
in the course of our proof has no particular significance. At the expense of decreasing c 
we can replace the exponent 2 on the right side of (3) by any arbitrary n. 

While the general method, namely that of applying the technique of successive 
approximations, is standard, the details are not as simple as might be believed upon 
first viewing the equation. It might be expected that in place of (3) one could assert 


| 2(w, , w. , t) | < ¢, Max | o(w, , w.) | exp [—v(w; + w)é], (4) 


provided that, as above, Max, | ¢(w, , w.) | is sufficiently small. This result seems 
difficult to prove, and it is quite possible that it is not true. 


2. Proof of theorem. From (2) we obtain, assuming for the moment that the solution 


exists, 


(1) 


= ¢(w, , We) exp [—v(w, + w)t] + [ exp [—v(wi + w:)(t — t)]J(2) dt, , 


a 


where we have set 


ar (6,W. — 6.W,) — 
J(z) = 2 I / , we rs w 2(0, + w, , 0 + we, b2(0, , 6, t) dd, dé. 


This equation is solved by the method of successive approximations, by means of 
the algorithm 
Zo = o(w, , w) exp [—v(wy + w5)t] 
(3) 


“n+1 “0 


2.21 = Sot [ exp [—v(wi + w>)(t — t)]J(z,) dt; , n= 0,1, --- 


“0 


The first step of the proof consists of showing that the sequence {z,} is uniformly 
I I £ 1 Lenj } 


bounded by an appropriate function of w, , w, and t, namely 
es oe 
"' = [1 + v(w, + wt)’ 


R. Bellman, On the boundedness of solutions of non-linear differential and difference equations, Trans. 


“ik. 


Amer. Math. Soc. 62, 357-86 (1947). 
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for all w, , w. , t, where we have set, for the sake of convenience, 
a = Max | ¢(u, , wv) |. (5) 
i 
Throughout we shall use, without further mention, the following simple inequalities, 


(a) e* < 1/1 +2), x>0 





(6) 
(b) a/b < (1 + az)/(1 + bz) < 1, b>a>Q0, z> 0. 
We turn now to the proof of (4). The result is clear for n = 0, since 
zy | < Max | ¢ | exp [—v(w; + w;)t] < ———— = 5 
lz |S u ia iaiiiin a vs [1 + v/2(w, + w,)t]" (7) 


lA 


fa/(1 + v(w, + w.)é]’. 

To treat the general case, we proceed by induction, assuming that (4) holds for n = 
0, 1, --- , N, and then proving it for N + 1. We first require upper bounds for J (zy). 
Introducing polar coordinates, 

6, = R cos y, w, = 7r cos 0 


(8) 
6. = Rsin jy, W. 


Il 


rsin @, 
J(2y) = 2r | | | sin (W — O)zy(O, , 02, DZv(O, +w,,0+w,,)dRdy. (9) 


v0 


Applying our inductive hypothesis, 








pe pte | 
) | < 128a°r <5 SST ST x3 | dR 
| J@w) | S 12801 i i F + vR*t)”-[1 + vi(R° + 2Rr cos (6 — yp) + a | ak oy 
(10) 
Pan ita dR 
< ¢,0° a 
= oar i F + vRt)*[1 + (Rk — | 
where c, = 2567. 
This last integral is now split into three parts, 
ac r/2 a2r ) 
C,r ty [ + cf | + qr [ =J,t+J.+43, (11) 
which we discuss separately. We have, since r — R > r/2for0 < R < 17/2, 
Gr ers dR 16c,r “ dR 
‘ < 7 ——— > ax an ree nn ae > ea ee ,..o ck on ae 
Ws (1 + or t/4)° I (1 + ort)” (1 + ort)” i (1 + vR*t)’ 
(12) 
i a oe 
t'’? (1 + or°t)’’ 


where 


c= 16c, | ds/(1 + 0s)’. 


“0 
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Turning to J, , we obtain 


he » 1 24¢,r° 
>< Pr ae 8 Se: 
Jas 2” (1 + or°t/4)? — (1 + ort)” (13) 

Finally, since R — r > rfor2r< R <o@, 


_¢M f"_ dR coy 1 
Ja S (1 + vr°t)? Jo, (A+ vR*t)? < (14) 


where 


C=C, i: ds/(1 + vs’)’. 


0 
Collecting the results, 


| J (zy) | < a’(J, + J2+ J 3) 


< | ota 1 4 24¢,r° ] (15) 
>a ti? (I + vr’t)’ (1 + ort)? . 





Applying these inequalities to (3), the result is 





leven |< leo] + [exp [—w%(t — 4] | Jew) | dt 
—— se 2 ‘ exp [—vr"(t — t,)] dt, 
$ (1 + ort)” bila elie I (1 + vr*t,)*(t,)'” (16) 


oa f' exp [—ur*(t — t,)] dt, 
4. 24c,a r [ (1 + vr’t,)* ° 


0 





The first integral may be written 





t t/2 t 
= oh +k. 1 
rf rf +r] : oe (17) 
Then 
_ f? exp [-or'(t — t)) _dt_ a os dt, 
her | +o)? Gy? S exp [or e/2lr I (1 + ort)” 


(18) 





] [ dt, Ca 
a ' —— . 
> + 0rt/4?" J, 1 + ort) ()” = 1 + ort)?’ 


where 
c, = 16 [ ds/(1 + vs)*s'”’. 


The second integral leads to 
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exp [—or"(t — t,)] dt, LV r [ " dt, 
le =r “J ; 5 ; xp [—or(t — 75 
dle | +r)? = 0 +o t/2) J 2 ih il il wy” 
< 4r _ [ * exp [—vr"t,] dt, c 4r ~— [ exp [—vr' t,] dt, (19) 
= (1 + vr’) (¢-—4)'" —(1+or't? J, te 
C- 
(1 + vr°t) 
where 


The second integral in (16) is broken up in like fashion into J; , the integral over 
[0, ¢/ 2], and J, , the integral over [t/2, ¢]. The first integral satisfies the inequality 


dt, c 


I, <r’ exp [—vr’t/2 —3 < "io, 20 
Sr exp(—» 1 (1 + 0r°t,)° — A +0rt”’ (20) 
where 
c, = 16 | ds/(1 + vs)’*, 
while the second satisfies the inequality 
I, < c;/(1 + or’ d)’, (21) 
with c, = 1/r. Collating these results, we obtain 
ea tae hee Wis > tds +0) + Sal 4 00 
—~ (1 + ort) (l + ort)" oe 
(22) 
fe 
S rad 9 
—~ (1 + ort) 
provided that 
a < 2/[(co + ¢3)(c, + €5) + 24c,(cs + C7)]. (23) 


This completes the induction. 
We must now show that zy converges to a solution of the original functional equation. 

In the usual manner, this is accomplished by demonstrating the uniform convergence 

of the series )>*., (2,.: — 2,). From (2) we obtain 

Cyr 


(1 + ort)” t) 


2-2 |< 2 | exp [—v(w, + w2)(t — t)] | Jo) | dt, < 
It now follows by induction, using the same procedure as above, that there exists a 
constant ¢, such that 


(c. \N+1 
< CoQ) 


2n+1 — 2n | &S eS ort)? ; 
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Hence if 


a < Min [1/e, , 1/(e, + 5; +c, + €)], (26) 


we have uniform convergence of zy to a function z over the entire (w, , w,) plane and 
the infinite ¢-interval, 0 < t <o. It follows from the uniform convergence that we can 
pass to the limit as N —©@ under the integral sign in (3), obtaining (1). Differentiation 
of (1) vields the original equation. 

The uniqueness is now established in the standard fashion. 


BOOK REVIEWS 


Table of the Bessel functions Y,(z) and Y,(z) for complex arguments. Prepared by the 
Computation Laboratory, National Bureau of Standards. Columbia University Press, 
New York, 1950. xi + 427 pp. $7.50. 


This volume supplements the earlier volume of tables of Jo(z) and Ji(z) for complex arguments [see 
Q. of Appl. Math. 2, 276 (1944) and 6, 95 (1948)]. The main tables give Yo(pe'®) and Yi(pe'*) to ten 
decimal places for p = 0(.01)10 and g¢ = 0(5°)90°. Auxiliary tables give Yo(pe'*®) — (2/2)Jo(pe'*) log p 
and Yi(pe'*) — (2/x)Ji:(pe**) log p + (2/mp)e~'*, the complex zeros of Bessel functions, and five-point 


Lagrangian interpolation coefficients. 
W. PRAGER 


The inelastic behavior of engineering materials and structures. By Alfred M. Freudenthal. 
John Wiley & Sons, Inc., New York and Chapman & Hall, Limited, London, 1950. 
xvi +587 pp. $7.50. 


The amazing scope of the book and its detailed coverage of so many facets of inelastic action bear elo- 
quent testimony to the author’s wide-spread reading and his own research. Quantum statistics, conven- 
tional metallurgy, mathematical theories of plasticity, visco-elasticity, stress analysis solutions, and design 
criteria for metals and concrete are all presented from a unified and extremely interesting point of view. 
The reader is made to feel equally familiar with electron clouds, simple and complex mechanical models of 
the behavior of real materials, Brownian motion, limit design, and testing machines. 

The only objection to be noted is that little indication is given at the highly controversial nature of 
the field. Opinions are often stated as facts. For example, this reviewer believes that much of the material 
on thermodynamics and the mechanical equation of state is based on demonstrably over-simple and prob- 
ably incorrect assumptions about the dissipated work. However, read with an open and skeptical mind 
the book is invaluable. 


D. C. DruckER 


Electromagnetic fields. Theory and application. Volume I: Mapping of Fields. By Ernst 
Weber. John Wiley & Sons, Inc., New York and Chapman & Hall Limited, London, 
1950. xiv + 590 pp. $10.00. 


The author has divided electromagnetic theory into static electric and magnetic fields on one hand 
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and dynamic electromagnetic fields on the other hand. This book (Volume I) deals with static electric 
and magnetic fields or in other words with the methods of potential theory. The list of contents: (1) The 
Electrostatic Field, (2) The Magnetostatic Field, (3) General Field Analogies, (4) Fields of Simple 
Geometries, (5) Experimental Mapping Methods, (6) Field Plotting Methods, (7) Two-Dimensional 
Analytic Solutions, (8) Three-Dimensional Analytic Solutions and Appendices. 

The book is very carefully written throughout and the reviewer feels that particular attention should 
be called to Chapter 7 which contains an excellent discussion of conformal transformations and mapping. 
Chapter 6 on field plotting methods contains a rather thorough discussion of the method of images both 
electric and magnetic. Under numerical methods of field plotting the author includes a discussion of 
relaxation methods, 

Altogether the reviewer feels that this book should be of real interest to a rather large group of 


physicists, engineers, and applied mathematicians. 
Roun TRUELL 


The evolution of scientific thought. From Newton to Einstein. By A. d’Abro. Second edition, 
revised and enlarged. Dover Publications, Inc., 1950. xx + 481 pp. $3.95. 


This ‘‘semi-popular book”’, using “non-technical language’’ (?) hopes to “serve as a general intro- 
duction” to the theory of relativity, “to whet the appetite for further knowledge” (the quotes are from 
Author’s Preface). Its four parts deal with pre-relativity physics, the special theory of relativity, the 
general theory of relativity, and the methodology of science. We already have half-a-dozen short and 
masterly treatments of the same field by Einstein, Weyl and Eddington, and this one suffers badly in 
comparison. This very long book has seemed consistently boring to this reader. As an introduction to 
serious work it is far from helpful, as it contains no index, no bibliography, and quotations are made 
throughout without page reference. There are fifteen portraits of mathematicians (without indication of 


origin) but the pleasure the reader should have in looking at them is completely spoiled by fourteen of 
them being crowded together at the center of the book. It is to be hoped that this publisher’s experiment, 
which no doubt made life easier for the make-up man, will not be repeated. 


P. Le CorBEILLER 
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